TY - JOUR AU - Andreeva, A. AU - Howorth, D. AU - Chothia, C. AU - Kulesha, E. AU - Murzin, A. G. PY - 2014 DA - 2014// TI - SCOP2 prototype: a new approach to protein structure mining JO - Nucleic Acids Res VL - 42 UR - https://doi.org/10.1093/nar/gkt1242 DO - 10.1093/nar/gkt1242 ID - Andreeva2014 ER - TY - JOUR AU - Punta, M. AU - Coggill, P. C. AU - Eberhardt, R. Y. AU - Mistry, J. AU - Tate, J. AU - Boursnell, C. AU - Pang, N. AU - Forslund, K. AU - Ceric, G. AU - Clements, J. AU - Heger, A. AU - Holm, L. AU - Sonnhammer, E. L. AU - Eddy, S. R. AU - Bateman, A. AU - Finn, R. D. PY - 2014 DA - 2014// TI - The Pfam protein families database JO - Nucleic Acids Res VL - 42 UR - https://doi.org/10.1093/nar/gkt1223 DO - 10.1093/nar/gkt1223 ID - Punta2014 ER - TY - JOUR AU - Ekman, D. AU - Bjorklund, A. K. AU - Frey-Skott, J. AU - Elofsson, A. PY - 2005 DA - 2005// TI - Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions JO - J Mol Biol VL - 348 UR - https://doi.org/10.1016/j.jmb.2005.02.007 DO - 10.1016/j.jmb.2005.02.007 ID - Ekman2005 ER - TY - JOUR AU - Forslund, K. AU - Sonnhammer, E. L. PY - 2008 DA - 2008// TI - Predicting protein function from domain content JO - Bioinformatics VL - 24 UR - https://doi.org/10.1093/bioinformatics/btn312 DO - 10.1093/bioinformatics/btn312 ID - Forslund2008 ER - TY - JOUR AU - Itoh, M. AU - Nacher, J. C. AU - Kuma, K. AU - Goto, S. AU - Kanehisa, M. PY - 2007 DA - 2007// TI - Evolutionary history and functional implications of protein domains and their combinations in eukaryotes JO - Genome Biol VL - 8 UR - https://doi.org/10.1186/gb-2007-8-6-r121 DO - 10.1186/gb-2007-8-6-r121 ID - Itoh2007 ER - TY - JOUR AU - Kummerfeld, S. K. AU - Teichmann, S. A. PY - 2009 DA - 2009// TI - Protein domain organisation: adding order JO - BMC Bioinformatics VL - 10 UR - https://doi.org/10.1186/1471-2105-10-39 DO - 10.1186/1471-2105-10-39 ID - Kummerfeld2009 ER - TY - JOUR AU - Pearson, W. R. AU - Sierk, M. L. PY - 2005 DA - 2005// TI - The limits of protein sequence comparison? JO - Curr Opin Struct Biol VL - 15 UR - https://doi.org/10.1016/j.sbi.2005.05.005 DO - 10.1016/j.sbi.2005.05.005 ID - Pearson2005 ER - TY - JOUR AU - Schwende, I. AU - Pham, T. D. PY - 2014 DA - 2014// TI - Pattern recognition and probabilistic measures in alignment-free sequence analysis JO - Brief Bioinform VL - 15 UR - https://doi.org/10.1093/bib/bbt070 DO - 10.1093/bib/bbt070 ID - Schwende2014 ER - TY - JOUR AU - Vinga, S. AU - Almeida, J. PY - 2003 DA - 2003// TI - Alignment-free sequence comparison-a review JO - Bioinformatics VL - 19 UR - https://doi.org/10.1093/bioinformatics/btg005 DO - 10.1093/bioinformatics/btg005 ID - Vinga2003 ER - TY - JOUR AU - Kelil, A. AU - Wang, S. AU - Brzezinski, R. AU - Fleury, A. PY - 2007 DA - 2007// TI - CLUSS: clustering of protein sequences based on a new similarity measure JO - BMC Bioinformatics VL - 8 UR - https://doi.org/10.1186/1471-2105-8-286 DO - 10.1186/1471-2105-8-286 ID - Kelil2007 ER - TY - JOUR AU - Martin, J. AU - Anamika, K. AU - Srinivasan, N. PY - 2010 DA - 2010// TI - Classification of protein kinases on the basis of both kinase and non-kinase regions JO - PLoS One VL - 5 UR - https://doi.org/10.1371/journal.pone.0012460 DO - 10.1371/journal.pone.0012460 ID - Martin2010 ER - TY - JOUR AU - Bhaskara, R. M. AU - Mehrotra, P. AU - Rakshambikai, R. AU - Gnanavel, M. AU - Martin, J. AU - Srinivasan, N. PY - 2014 DA - 2014// TI - The relationship between classification of multi-domain proteins using an alignment-free approach and their functions: a case study with Immunoglobulins JO - Mol Biosyst VL - 10 UR - https://doi.org/10.1039/c3mb70443b DO - 10.1039/c3mb70443b ID - Bhaskara2014 ER - TY - JOUR AU - Ward, J. H. PY - 1963 DA - 1963// TI - Hierarchial grouping to optimize an objective function JO - J Am Stat Assoc VL - 58 UR - https://doi.org/10.1080/01621459.1963.10500845 DO - 10.1080/01621459.1963.10500845 ID - Ward1963 ER - TY - CHAP PY - 2008 DA - 2008// TI - R: A Language and Environment for Statistical Computing BT - R Foundation for Statistical Computing PB - Vienna CY - Austria ID - ref14 ER - TY - JOUR AU - Levandowsky, M. AU - Winter, D. PY - 1971 DA - 1971// TI - Distance between sets JO - Nature VL - 234 UR - https://doi.org/10.1038/234034a0 DO - 10.1038/234034a0 ID - Levandowsky1971 ER - TY - JOUR AU - Goodman, L. A. AU - Kruskal, W. H. PY - 1954 DA - 1954// TI - Measures of association for cross classifications JO - J Am Stat Assoc VL - 49 ID - Goodman1954 ER - TY - JOUR AU - Lin, K. AU - Zhu, L. AU - Zhang, D. Y. PY - 2006 DA - 2006// TI - An initial strategy for comparing proteins at the domain architecture level JO - Bioinformatics VL - 22 UR - https://doi.org/10.1093/bioinformatics/btl366 DO - 10.1093/bioinformatics/btl366 ID - Lin2006 ER - TY - JOUR AU - Larkin, M. A. AU - Blackshields, G. AU - Brown, N. P. AU - Chenna, R. AU - McGettigan, P. A. AU - McWilliam, H. AU - Valentin, F. AU - Wallace, I. M. AU - Wilm, A. AU - Lopez, R. AU - Thompson, J. D. AU - Gibson, T. J. AU - Higgins, D. G. PY - 2007 DA - 2007// TI - Clustal W and Clustal X version 2.0 JO - Bioinformatics VL - 23 UR - https://doi.org/10.1093/bioinformatics/btm404 DO - 10.1093/bioinformatics/btm404 ID - Larkin2007 ER - TY - JOUR AU - Huang, Y. AU - Niu, B. AU - Gao, Y. AU - Fu, L. AU - Li, W. PY - 2010 DA - 2010// TI - CD-HIT Suite: a web server for clustering and comparing biological sequences JO - Bioinformatics VL - 26 UR - https://doi.org/10.1093/bioinformatics/btq003 DO - 10.1093/bioinformatics/btq003 ID - Huang2010 ER - TY - JOUR PY - 2012 DA - 2012// TI - Reorganizing the protein space at the Universal Protein Resource (UniProt) JO - Nucleic Acids Res VL - 40 UR - https://doi.org/10.1093/nar/gkr981 DO - 10.1093/nar/gkr981 ID - ref20 ER - TY - CHAP AU - Sul, S. J. AU - Williams, T. L. PY - 2007 DA - 2007// TI - A Randomized Algorithm for Comparing Sets of Phylogenetic Trees BT - Proceedings of the Asia-Pacific Bioinformatics Conference 2007 ID - Sul2007 ER -