Treffer: A Review on Efficient and Scalable Graph-Based Clustering Algorithms for Protein Complex Identification in PPI Networks.
Original Publication: New York : Alan R. Liss, c1986-
G. D. Bader and C. W. Hogue , “Analyzing Yeast Protein‐Protein Interaction Data Obtained From Different Sources,” Nature Biotechnology 20, no. 10 (2002): 991–997.
H. N. Chua and L. Wong , “Increasing the Reliability of Protein Interactomes,” Drug Discovery Today 13, no. 15–16 (2008): 652–658.
D. J. Watts and S. H. Strogatz , “Collective Dynamics of ‘Small‐World’ Networks,” Nature 393, no. 6684 (1998): 440–442.
J. Moody and J. Coleman , Clustering and Cohesion in Networks: Concepts and Measures (Academia, 2015), 906–912.
L. Lin , R. Li , and T. Jia , “Scalable and Effective Conductance‐Based Graph Clustering,” Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 4 (2023): 4471–4478.
S. Muff , F. Rao , and A. Caflisch , “Local Modularity Measure for Network Clusterizations,” Physical Review E 72, no. 5 (2005): 056107.
J. B. Pereira‐Leal , E. D. Levy , and S. A. Teichmann , “The Origins and Evolution of Functional Modules: Lessons From Protein Complexes,” Philosophical Transactions of the Royal Society, B: Biological Sciences 361, no. 1467 (2006): 507–517.
S. Srihari and H. W. Leong , “Employing Functional Interactions for Characterisation and Detection of Sparse Complexes From Yeast PPI Networks,” International Journal of Bioinformatics Research and Applications 8, no. 3–4 (2012): 286–304.
H. Liu , T. N. Beck , E. A. Golemis , and I. G. Serebriiskii , “Integrating In Silico Resources to Map a Signaling Network,” Gene Function Analysis. Methods in Molecular Biology 1101 (2014): 197–245.
P. C. Havugimana , G. T. Hart , T. Nepusz , et al., “A Census of Human Soluble Protein Complexes,” Cell 150, no. 5 (2012): 1068–1081.
B. P. Kelley , R. Sharan , R. M. Karp , et al., “Conserved Pathways Within Bacteria and Yeast as Revealed by Global Protein Network Alignment,” Proceedings of the National Academy of Sciences 100, no. 20 (2003): 11394–11399.
R. Sharan , T. Ideker , B. P. Kelley , R. Shamir , and R. M. Karp , “Identification of Protein Complexes by Comparative Analysis of Yeast and Bacterial Protein Interaction Data,” in Proceedings of the Eighth Annual International Conference on Research in Computational Molecular Biology (ACM, 2004), 282–289.
M. Li , W. Chen , J. Wang , F. X. Wu , and Y. Pan , “Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks,” BioMed Research International 2014, no. 1 (2014): 375262.
J. A. Marsh , S. A. Teichmann , and J. D. Forman‐Kay , “Probing the Diverse Landscape of Protein Exibility and Binding,” Current Opinion in Structural Biology 22, no. 5 (2012): 643–650.
H. Hegyi , E. Schad , and P. Tompa , “Structural Disorder Promotes Assembly of Protein Complexes,” BMC Structural Biology 7, no. 1 (2007): 1–9.
A. L. Barabási , N. Gulbahce , and J. Loscalzo , “Network Medicine: A Network‐Based Approach to Human Disease,” Nature Reviews Genetics 12, no. 1 (2011): 56–68.
S. Srihari , P. B. Madhamshettiwar , S. Song , et al., “Complex‐Based Analysis of Dysregulated Cellular Processes in Cancer,” BMC Systems Biology 8, no. 4 (2014): 1–15.
A. W. Rives and T. Galitski , “Modular Organization of Cellular Networks,” Proceedings of the National Academy of Sciences 100, no. 3 (2003): 1128–1133.
C. Lin , Y. R. Cho , W. C. Hwang , P. Pei , and A. Zhang , “Clustering Methods in Protein‐Protein Interaction Network,” in Knowledge Discovery in Bioinformatics: Techniques, Methods and Application (Wiley Series in Bioinformatics, 2007), 1–35.
G. D. Bader and C. W. Hogue , “An Automated Method for Finding Molecular Complexes in Large Protein Interaction Networks,” BMC Bioinformatics 4, no. 1 (2003): 1–27.
B. Adamcsek , G. Palla , I. J. Farkas , I. Derenyi , and T. Vicsek , “Cfinder: Locating Cliques and Overlapping Modules in Biological Networks,” Bioinformatics 22, no. 8 (2006): 1021–1023.
M. Altaf‐Ul‐Amin , Y. Shinbo , K. Mihara , K. Kurokawa , and S. Kanaya , “Development and Implementation of an Algorithm for Detection of Protein Complexes in Large Interaction Networks,” BMC Bioinformatics 7, no. 1 (2006): 1–13.
M. Wu , X. Li , C. K. Kwoh , and S. K. Ng , “A Core‐Attachment Based Method to Detect Protein Complexes in PPI Networks,” BMC Bioinformatics 10, no. 1 (2009): 1–16.
M. Haque , R. Sarmah , and D. K. Bhattacharyya , “A Common Neighbor Based Technique to Detect Protein Complexes in PPI Networks,” Journal of Genetic Engineering and Biotechnology 16, no. 1 (2018): 227–238.
A. D. King , N. Przulj , and I. Jurisica , “Protein Complex Prediction via Cost‐Based Clustering,” Bioinformatics 20, no. 17 (2004): 3013–3020.
I. A. Kovacs , R. Palotai , M. S. Szalay , and P. Csermely , “Community Landscapes: An Integrative Approach to Determine Overlapping Network Module Hierarchy, Identify Key Nodes and Predict Network Dynamics,” PLoS One 5, no. 9 (2010): e12528.
T. Nepusz , H. Yu , and A. Paccanaro , “Detecting Overlapping Protein Complexes in Protein‐Protein Interaction Networks,” Nature Methods 9, no. 5 (2012): 471–472.
A. J. Enright , S. Van Dongen , and C. A. Ouzounis , “An Efficient Algorithm for Large‐Scale Detection of Protein Families,” Nucleic Acids Research 30, no. 7 (2002): 1575–1584.
W. Hwang , Y. R. Cho , A. Zhang , and M. Ramanathan , “A Novel Functional Module Detection Algorithm for Protein‐Protein Interaction Networks,” Algorithms for Molecular Biology 1, no. 1 (2006): 1–11.
Y. Y. Ahn , J. P. Bagrow , and S. Lehmann , “Link Communities Reveal Multiscale Complexity in Networks,” Nature 466, no. 7307 (2010): 761–764.
M. Tasgin , A. Herdagdelen , and H. Bingol , “Community Detection in Complex Networks Using Genetic Algorithms,” Physics and Society (2007), Preprint arXiv:07110491.
V. Satuluri , S. Parthasarathy , and D. Ucar , “Markov Clustering of Protein Interaction Networks With Improved Balance and Scalability,” in Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology (ACM, 2010), 247–256.
Y. K. Shih and S. Parthasarathy , “Identifying Functional Modules in Interaction Networks Through Overlapping Markov Clustering,” Bioinformatics 28, no. 18 (2012): i473–i479.
Y. Zhang , “I‐Tasser Server for Protein 3D Structure Prediction,” BMC Bioinformatics 9, no. 1 (2008): 1–8.
S. Zhang , X. Ning , and X. S. Zhang , “Identification of Functional Modules in a PPI Network by Clique Percolation Clustering,” Computational Biology and Chemistry 30, no. 6 (2006): 445–451.
G. Liu , L. Wong , and H. N. Chua , “Complex Discovery From Weighted PPI Networks,” Bioinformatics 25, no. 15 (2009): 1891–1897.
E. Georgii , S. Dietmann , T. Uno , P. Pagel , and K. Tsuda , “Enumeration of Condition Dependent Dense Modules in Protein Interaction Networks,” Bioinformatics 25, no. 7 (2009): 933–940.
B. J. Frey and D. Dueck , “Clustering by Passing Messages Between Data Points,” Science 315, no. 5814 (2007): 972–976.
K. Macropol , T. Can , and A. K. Singh , “Rrw: Repeated Random Walks on Genome‐Scale Protein Networks for Local Cluster Discovery,” BMC Bioinformatics 10, no. 1 (2009): 1–10.
J. Chen and B. Yuan , “Detecting Functional Modules in the Yeast Protein‐Protein Interaction Network,” Bioinformatics 22, no. 18 (2006): 2283–2290.
D. Dotan‐Cohen , A. A. Melkman , and S. Kasif , “Hierarchical Tree Snipping: Clustering Guided by Prior Knowledge,” Bioinformatics 23, no. 24 (2007): 3335–3342.
M. Mete , F. Tang , X. Xu , and N. Yuruk , “A Structural Approach for Finding Functional Modules From Large Biological Networks,” BMC Bioinformatics 9, no. Suppl 9 (2008): 1–14.
S. Asur , D. Ucar , and S. Parthasarathy , “An Ensemble Framework for Clustering Protein‐Protein Interaction Networks,” Bioinformatics 23, no. 13 (2007): i29–i40.
D. Greene , G. Cagney , N. Krogan , and P. Cunningham , “Ensemble Non‐Negative Matrix Factorization Methods for Clustering Protein‐Protein Interactions,” Bioinformatics 24, no. 15 (2008): 1722–1728.
S. Navlakha and C. Kingsford , “Exploring Biological Network Dynamics With Ensembles of Graph Partitions,” Pacific Symposium on Biocomputing 2010 (2010): 166–177.
E. Segal , H. Wang , and D. Koller , “Discovering Molecular Pathways From Protein Interaction and Gene Expression Data,” Bioinformatics 19, no. suppl 1 (2003): i264–i272.
H. Zheng , H. Wang , and D. H. Glass , “Integration of Genomic Data for Inferring Protein Complexes From Global Protein‐Protein Interaction Networks,” IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 38, no. 1 (2008): 5–16.
L. Shi , X. Lei , and A. Zhang , “Protein Complex Detection With Semi‐Supervised Learning in Protein Interaction Networks,” Proteome Science 9, no. 1 (2011): 1–9.
X. L. Li , C. S. Foo , S. H. Tan , and S. K. Ng , “Interaction Graph Mining for Protein Complexes Using Local Clique Merging,” Genome Informatics 16, no. 2 (2005): 260–269.
H. C. Leung , Q. Xiang , S. M. Yiu , and F. Y. Chin , “Predicting Protein Complexes From PPI Data: A Core‐Attachment Approach,” Journal of Computational Biology 16, no. 2 (2009): 133–144.
X. L. Li , C. S. Foo , and S. K. Ng , “Discovering Protein Complexes in Dense Reliable Neighborhoods of Protein Interaction Networks,” Computational Systems Bioinformatics 6 (2007): 157–168.
H. N. Chua , K. Ning , W. K. Sung , H. W. Leong , and L. Wong , “Using Indirect Protein‐Protein Interactions for Protein Complex Prediction,” Computational Systems Bioinformatics 6 (2007): 97–109.
Y. Zhang , H. Lin , Z. Yang , J. Wang , Y. Liu , and S. Sang , “A Method for Predicting Protein Complex in Dynamic PPI Networks,” BMC Bioinformatics 17, no. 7 (2016): 533–543.
E. Hirsh and R. Sharan , “Identification of Conserved Protein Complexes Based on a Model of Protein Network Evolution,” Bioinformatics 23, no. 2 (2007): e170–e176.
C. H. Yong , G. Liu , H. N. Chua , and L. Wong , “Supervised Maximum‐Likelihood Weighting of Composite Protein Networks for Complex Prediction,” BMC Systems Biology 6 (2012): 1–21.
B. Xu , Y. Wang , Z. Wang , J. Zhou , S. Zhou , and J. Guan , “An Effective Approach to Detecting Both Small and Large Complexes From Protein‐Protein Interaction Networks,” BMC Bioinformatics 18, no. 12 (2017): 19–28.
C. H. Yong , O. Maruyama , and L. Wong , “Discovery of Small Protein Complexes From PPI Networks With Size‐Specific Supervised Weighting,” BMC Systems Biology 8 (2014): 1–15.
C. Von Mering , R. Krause , B. Snel , et al., “Comparative Assessment of Large‐Scale Data Sets of Protein‐Protein Interactions,” Nature 417, no. 6887 (2002): 399–403.
L. Giot , J. S. Bader , C. Brouwer , et al., “A Protein Interaction Map of Drosophila melanogaster ,” Science 302, no. 5651 (2003): 1727–1736.
S. Li , C. M. Armstrong , N. Bertin , et al., “A Map of the Interactome Network of the Metazoan C. elegans ,” Science 303, no. 5657 (2004): 540–543.
H. W. Mewes , D. Frishman , U. Guldener , et al., “MIPS: A Database for Genomes and Protein Sequences,” Nucleic Acids Research 30, no. 1 (2002): 31–34.
R. Jansen , N. Lan , J. Qian , and M. Gerstein , “Integration of Genomic Datasets to Predict Protein Complexes in Yeast,” Journal of Structural and Functional Genomics 2, no. 2 (2002): 71–81.
P. Uetz , L. Giot , G. Cagney , et al., “A Comprehensive Analysis of Protein‐Protein Interactions in Saccharomyces cerevisiae ,” Nature 403, no. 6770 (2000): 623–627.
T. Ito , T. Chiba , R. Ozawa , M. Yoshida , M. Hattori , and Y. Sakaki , “A Comprehensive Twohybrid Analysis to Explore the Yeast Protein Interactome,” Proceedings of the National Academy of Sciences 98, no. 8 (2001): 4569–4574.
B. L. Drees , B. Sundin , E. Brazeau , et al., “A Protein Interaction Map for Cell Polarity Development,” Journal of Cell Biology 154, no. 3 (2001): 549–576.
M. Fromont‐Racine , A. E. Mayes , A. Brunet‐Simon , et al., “Genome‐Wide Protein Interaction Screens Reveal Functional Networks Involving SM‐Like Proteins,” Yeast 17, no. 2 (2000): 95–110.
Y. Ho , A. Gruhler , A. Heilbut , et al., “Systematic Identification of Protein Complexes in Saccharomyces cerevisiae by Mass Spectrometry,” Nature 415, no. 6868 (2002): 180–183.
A. C. Gavin , M. Bosche , R. Krause , et al., “Functional Organization of the Yeast Proteome by Systematic Analysis of Protein Complexes,” Nature 415, no. 6868 (2002): 141–147.
A. H. Y. Tong , B. Drees , G. Nardelli , et al., “A Combined Experimental and Computational Strategy to Define Protein Interaction Networks for Peptide Recognition Modules,” Science 295, no. 5553 (2002): 321–324.
M. C. Costanzo , M. E. Crawford , J. E. Hirschman , et al., “Ypd, Pombepd and Wormpd: Model Organism Volumes of Bioknowledge Library, an Integrated Resource for Protein Information,” Nucleic Acids Research 29, no. 1 (2001): 75–79.
R. Oughtred , C. Stark , B. J. Breitkreutz , et al., “The Biogrid Interaction Database: 2019 Update,” Nucleic Acids Research 47, no. D1 (2019): D529–D541.
N. J. Krogan , G. Cagney , H. Yu , et al., “Global Landscape of Protein Complexes in the Yeast Saccharomyces cerevisiae ,” Nature 440, no. 7084 (2006): 637–643.
P. Aloy , B. Bottcher , H. Ceulemans , et al., “Structure‐Based Assembly of Protein Complexes in Yeast,” Science 303, no. 5666 (2004): 2026–2029.
I. Xenarios , L. Salwinski , X. J. Duan , P. Higney , S. M. Kim , and D. Eisenberg , “DIP, the Database of Interacting Proteins: A Research Tool for Studying Cellular Networks of Protein Interactions,” Nucleic Acids Research 30, no. 1 (2002): 303–305.
S. S. Dwight , M. A. Harris , K. Dolinski , et al., “Saccharomyces Genome Database (SGD) Provides Secondary Gene Annotation Using the Gene Ontology (GO),” Nucleic Acids Research 30, no. 1 (2002): 69–72.
L. Kiemer , S. Costa , M. Ueffing , and G. Cesareni , “Wi‐Phi: A Weighted Yeast Interactome Enriched for Direct Physical Interactions,” Proteomics 7, no. 6 (2007): 932–943.
M. Ashburner , C. A. Ball , J. A. Blake , et al., “Gene Ontology: Tool for the Uni Cation of Biology,” Nature Genetics 25, no. 1 (2000): 25–29.
E. L. Hong , R. Balakrishnan , Q. Dong , et al., “Gene Ontology Annotations at Sgd: New Data Sources and Annotation Methods,” Nucleic Acids Research 36, no. 1 (2007): D577–D581.
W. Rungsarityotin , R. Krause , A. Schodl , and A. Schliep , “Identifying Protein Complexes Directly From High‐Throughput Tap Data With Markov Random Fields,” BMC Bioinformatics 8, no. 1 (2007): 1–19.
S. R. Collins , P. Kemmeren , X. C. Zhao , et al., “Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae ,” Molecular and Cellular Proteomics 6, no. 3 (2007): 439–450.
I. Xenarios , D. W. Rice , L. Salwinski , M. K. Baron , E. M. Marcotte , and D. Eisenberg , “Dip: The Database of Interacting Proteins,” Nucleic Acids Research 28, no. 1 (2000): 289–291.
H. W. Mewes , C. Amid , R. Arnold , et al., “MIPS: Analysis and Annotation of Proteins From Whole Genomes,” Nucleic Acids Research 32, no. suppl 1 (2004): D41–D44.
G. Palla , I. Derenyi , I. Farkas , and T. Vicsek , “Uncovering the Overlapping Community Structure of Complex Networks in Nature and Society,” Nature 435, no. 7043 (2005): 814–818.
S. Pu , J. Wong , B. Turner , E. Cho , and S. J. Wodak , “Up‐To‐Date Catalogues of Yeast Protein Complexes,” Nucleic Acids Research 37, no. 3 (2009): 825–831.
P. Jiang and M. Singh , “SPICi: A Fast Clustering Algorithm for Large Biological Networks,” Bioinformatics 26, no. 8 (2010): 1105–1111.
J. Wang , M. Li , J. Chen , and Y. Pan , “A Fast Hierarchical Clustering Algorithm for Functional Modules Discovery in Protein Interaction Networks,” IEEE/ACM Transactions on Computational Biology and Bioinformatics 8, no. 3 (2010): 607–620.
U. Guldener , M. Munsterkotter , M. Oesterheld , et al., “MPact: The Mips Protein Interaction Resource on Yeast,” Nucleic Acids Research 34, no. 1 (2006): D436–D441.
A. Franceschini , D. Szklarczyk , S. Frankild , et al., “STRING v9.1: Protein‐Protein Interaction Networks, With Increased Coverage and Integration,” Nucleic Acids Research 41, no. D1 (2012): D808–D815.
Y. Zhang , H. Lin , Z. Yang , J. Wang , Y. Li , and B. Xu , “Protein Complex Prediction in Large Ontology Attributed Protein‐Protein Interaction Networks,” IEEE/ACM Transactions on Computational Biology and Bioinformatics 10, no. 3 (2013): 729–741.
C. H. Chin , S. H. Chen , C. W. Ho , M. T. Ko , and C. Y. Lin , “A Hub‐Attachment Based Method to Detect Functional Modules From Confidence‐Scored Protein Interactions and Expression Profiles,” BMC Bioinformatics 11, no. 1 (2010): 1–9.
B. Xu and J. Guan , “From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein‐To‐Protein Interaction Networks,” IEEE/ACM Transactions on Computational Biology and Bioinformatics 11, no. 4 (2014): 616–627.
B. P. Tu , A. Kudlicki , M. Rowicka , and S. L. McKnight , “Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes,” Science 310, no. 5751 (2005): 1152–1158.
P. Nymark , P. M. Lindholm , M. V. Korpela , et al., “Gene Expression Profiles in Asbestos‐Exposed Epithelial and Mesothelial Lung Cell Lines,” BMC Genomics 8, no. 1 (2007): 1–14.
A. Ruepp , B. Brauner , I. Dunger‐Kaltenbach , et al., “CORUM: The Comprehensive Resource of Mammalian Protein Complexes,” Nucleic Acids Research 36, no. suppl 1 (2007): D646–D650.
L. Salwinski , C. S. Miller , A. J. Smith , F. K. Pettit , J. U. Bowie , and D. Eisenberg , “The Database of Interacting Proteins: 2004 Update,” Nucleic Acids Research 32, no. suppl 1 (2004): D449–D451.
M. Giurgiu , J. Reinhard , B. Brauner , et al., “CORUM: The Comprehensive Resource of Mammalian Protein Complexes 2019,” Nucleic Acids Research 47, no. D1 (2019): D559–D563.
A. Chatr‐Aryamontri , B. J. Breitkreutz , R. Oughtred , et al., “The Biogrid Interaction Database: 2015 Update,” Nucleic Acids Research 43, no. D1 (2015): D470–D478.
A. Lakizadeh and S. Jalili , “BICAMWI: A Genetic‐Based Biclustering Algorithm for Detecting Dynamic Protein Complexes,” PLoS One 11, no. 7 (2016): e0159923.
L. Ou‐Yang , D. Q. Dai , X. L. Li , M. Wu , X. F. Zhang , and P. Yang , “Detecting Temporal Protein Complexes From Dynamic Protein‐Protein Interaction Networks,” BMC Bioinformatics 15, no. 1 (2014): 1–14.
S. Jalili , S. A. Marashi , and A. lakizadeh , “CAMWI: Detecting Protein Complexes Using Weighted Clustering Coefficient and Weighted Density,” Computational Biology and Chemistry 58 (2015): 231–240.
A. Lakizadeh , S. Jalili , and S. A. Marashi , “PCD‐GED: Protein Complex Detection Considering PPI Dynamics Based on Time Series Gene Expression Data,” Journal of Theoretical Biology 378 (2015): 31–38.
X. F. Zhang , D. Q. Dai , L. Ou‐Yang , and H. Yan , “Detecting Overlapping Protein Complexes Based on a Generative Model With Functional and Topological Properties,” BMC Bioinformatics 15, no. 1 (2014): 1–15.
R. Wang , G. Liu , and C. Wang , “Identifying Protein Complexes Based on an Edge Weight Algorithm and Core‐Attachment Structure,” BMC Bioinformatics 20, no. 1 (2019): 1–20.
M. Wu , Z. Xie , X. Li , C. K. Kwoh , and J. Zheng , “Identifying Protein Complexes From Heterogeneous Biological Data,” Proteins: Structure, Function, and Bioinformatics 81, no. 11 (2013): 2023–2033.
J. Zhao and X. Lei , “Detecting Overlapping Protein Complexes in Weighted PPI Network Based on Overlay Network Chain in Quotient Space,” BMC Bioinformatics 20, no. 25 (2019): 1–12.
D. Szklarczyk , A. Franceschini , S. Wyder , et al., “STRING v10: Protein‐Protein Interaction Networks, Integrated Over the Tree of Life,” Nucleic Acids Research 43, no. D1 (2015): D447–D452.
M. D. McDowall , M. S. Scott , and G. J. Barton , “PIPS: Human Protein‐Protein Interaction Prediction Database,” Nucleic Acids Research 37, no. suppl 1 (2009): D651–D656.
M. Pellegrini , M. Baglioni , and F. Geraci , “Protein Complex Prediction for Large Protein Protein Interaction Networks With the Core&Peel Method,” BMC Bioinformatics 17, no. 12 (2016): 37–58.
A. Maddi and C. Eslahchi , “Discovering Overlapped Protein Complexes From Weighted PPI Networks by Removing Inter‐Module Hubs,” Scientific Reports 7, no. 1 (2017): 1–14.
E. M. Hanna and N. Zaki , “Detecting Protein Complexes in Protein Interaction Networks Using a Ranking Algorithm With a Re Ned Merging Procedure,” BMC Bioinformatics 15, no. 1 (2014): 1–11.
C. C. Friedel , J. Krumsiek , and R. Zimmer , “Bootstrapping the Interactome: Unsupervised Identification of Protein Complexes in Yeast,” Journal of Computational Biology 16, no. 8 (2009): 971–987.
C. Y. Ma , Y. P. P. Chen , B. Berger , and C. S. Liao , “Identification of Protein Complexes by Integrating Multiple Alignment of Protein Interaction Networks,” Bioinformatics 33, no. 11 (2017): 1681–1688.
N. Zaki , D. Efimov , and J. Berengueres , “Protein Complex Detection Using Interaction Reliability Assessment and Weighted Clustering Coefficient,” BMC Bioinformatics 14, no. 1 (2013): 1–9.
W. Peng , J. Wang , B. Zhao , and L. Wang , “Identification of Protein Complexes Using Weighted Pagerank‐Nibble Algorithm and Core‐Attachment Structure,” IEEE/ACM Transactions on Computational Biology and Bioinformatics 12, no. 1 (2014): 179–192.
J. Zhang , C. Zhong , Y. Huang , H. X. Lin , and M. Wang , “A Method for Identifying Protein Complexes With the Features of Joint Co‐Localization and Joint Co‐Expression in Static PPI Networks,” Computers in Biology and Medicine 111 (2019): 103333.
S. Omranian , A. Angeleska , and Z. Nikoloski , “PC2P: Parameter‐Free Network‐Based Prediction of Protein Complexes,” Bioinformatics 37, no. 1 (2021): 73–81.
Q. Liu , J. Song , and J. Li , “Using Contrast Patterns Between True Complexes and Random Subgraphs in PPI Networks to Predict Unknown Protein Complexes,” Scientific Reports 6, no. 1 (2016): 1–15.
Y. Dong , Y. Sun , and C. Qin , “Predicting Protein Complexes Using a Supervised Learning Method Combined With Local Structural Information,” PLoS One 13, no. 3 (2018): e0194124.
D. L. K. Wong , X. L. Li , M. Wu , J. Zheng , and S. K. Ng , “PLW: Probabilistic Local Walks for Detecting Protein Complexes From Protein Interaction Networks,” BMC Genomics 14, no. 5 (2013): 1–15.
C. Stark , B. J. Breitkreutz , T. Reguly , L. Boucher , A. Breitkreutz , and M. Tyers , “Biogrid: A General Repository for Interaction Datasets,” Nucleic Acids Research 34, no. suppl 1 (2006): D535–D539.
M. Wu , X. Li , and C. K. Kwoh , “Algorithms for Detecting Protein Complexes in PPI Networks: An Evaluation Study,” in Proceedings of Third IAPR International Conference on Pattern Recognition in Bioinformatics (PRIB 2008) (Springer‐Verlag, 2008), 15–17.
B. Li and B. Liao , “Protein Complexes Prediction Method Based on Core Attachment Structure and Functional Annotations,” International Journal of Molecular Sciences 18, no. 9 (2017): 1910.
Z. Zhang , J. Song , J. Tang , X. Xu , and F. Guo , “Detecting Complexes From Edge‐Weighted PPI Networks via Genes Expression Analysis,” BMC Systems Biology 12, no. 4 (2018): 29–40.
X. Shen , L. Yi , X. Jiang , et al., “Identifying Protein Complex by Integrating Characteristic of Core‐Attachment Into Dynamic PPI Network,” PLoS One 12, no. 10 (2017): e0186134.
S. Patra and A. Mohapatra , “Protein Complex Prediction in Interaction Network Based on Network Motif,” Computational Biology and Chemistry 89 (2020): 107399.
M. Chellal and I. Benmessahel , “Dynamic Complex Protein Detection Using Binary Harris Hawks Optimization,” Journal of Physics: Conference Series 1642, no. 012019 (2020): 1–6.
Y. R. Cho , W. Hwang , M. Ramanathan , and A. Zhang , “Semantic Integration to Identify Overlapping Functional Modules in Protein Interaction Networks,” BMC Bioinformatics 8, no. 1 (2007): 1–13.
M. Li , J. Chen , J. Wang , B. Hu , and G. Chen , “Modifying the Dpclus Algorithm for Identifying Protein Complexes Based on New Topological Structures,” BMC Bioinformatics 9, no. 1 (2008): 1–16.
G. Geva and R. Sharan , “Identification of Protein Complexes From Co‐Immunoprecipitation Data,” Bioinformatics 27, no. 1 (2011): 111–117.
L. Hu and K. C. Chan , “A Density‐Based Clustering Approach for Identifying Overlapping Protein Complexes With Functional Preferences,” BMC Bioinformatics 16 (2015): 1–16.
Y. Xu , J. Zhou , S. Zhou , and J. Guan , “CPredictor3. 0: Detecting Protein Complexes From PPI Networks With Expression Data and Functional Annotations,” BMC Systems Biology 11, no. 7 (2017): 45–56.
L. Hu , X. Yuan , X. Liu , S. Xiong , and X. Luo , “Efficiently Detecting Protein Complexes From Protein Interaction Networks via Alternating Direction Method of Multipliers,” IEEE/ACM Transactions on Computational Biology and Bioinformatics 16, no. 6 (2018): 1922–1935.
A. SabziNezhad and S. Jalili , “DPCT: A Dynamic Method for Detecting Protein Complexes From Tap‐Aware Weighted PPI Network,” Frontiers in Genetics 11 (2020): 567.
S. Omranian , A. Angeleska , and Z. Nikoloski , “Efficient and Accurate Identification of Protein Complexes From Protein‐Protein Interaction Networks Based on the Clustering Coefficient,” Computational and Structural Biotechnology Journal 19 (2021): 5255–5263.
T. R. Sahoo , S. Vipsita , and S. Patra , “Protein Complex Prediction Based on Dense Sub‐Graph Merging,” International Journal of Data Mining and Bioinformatics 26, no. 3–4 (2021): 129–150.
R. Wang , H. Ma , and C. Wang , “An Ensemble Learning Framework for Detecting Protein Complexes From PPI Networks,” Frontiers in Genetics 13 (2022): 839949.
M. S. Islam , M. R. Islam , and A. S. Ali , “Protein Complex Prediction in Large Protein‐Protein Interaction Network,” Informatics in Medicine Unlocked 30 (2022): 100947.
T. R. Sahoo , S. Vipsita , and S. Patra , “Complex Prediction in Large PPI Networks Using Expansion and Stripe of Core Cliques,” Interdisciplinary Sciences: Computational LIfe Sciences 15, no. 3 (2023): 331–348.
T. R. Sahoo , S. Patra , and S. Vipsita , “Decision Tree Classifier Based on Topological Characteristics of Subgraph for the Mining of Protein Complexes From Large Scale PPI Networks,” Computational Biology and Chemistry 106 (2023): 107935.
L. Hu , J. Zhang , X. Pan , H. Yan , and Z. H. You , “HiSCF: Leveraging Higher‐Order Structures for Clustering Analysis in Biological Networks,” Bioinformatics 37, no. 4 (2021): 542–550.
L. Hu , J. Zhang , X. Pan , X. Luo , and H. Yuan , “An Effective Link‐Based Clustering Algorithm for Detecting Overlapping Protein Complexes in Protein‐Protein Interaction Networks,” IEEE Transactions on Network Science and Engineering 8, no. 4 (2021): 3275–3289.
Y. Yang , G. Li , D. Li , J. Zhang , P. Hu , and L. Hu , “Integrating Fuzzy Clustering and Graph Convolution Network to Accurately Identify Clusters From Attributed Graph,” IEEE Transactions on Network Science and Engineering 12, no. 2 (2025): 1112–1125.
Weitere Informationen
Network clustering is employed in bioinformatics and data mining studies to investigate the structural and functional properties of protein-protein interaction (PPI) networks. In multiple studies over the past two decades, network clustering has proven valuable for uncovering functional modules and elucidating the functions of previously undiscovered proteins. Protein complexes are vital cellular components that play a crucial role in generating biological activity. Experimental techniques have inherent limitations in inferring protein complexes. Given these constraints, numerous computational methods have emerged over the past decade for predicting protein complexes. Typically, these methods take the input PPI data and generate predicted protein complexes as output subnetworks. Most of these methods have shown encouraging outcomes in predicting protein complexes. Prediction is challenging for sparse, small, and overlapping complexes. New strategies should include explicit knowledge about the biological characteristics of proteins to increase performance. Furthermore, specific issues should be considered more effectively in the future while developing new complex prediction algorithms. The bioinformatics community has developed various techniques for clustering PPI networks, which we identified, analyzed, and compared in this paper. This review evaluates various graph clustering algorithms for protein complex identification, facilitating the benchmarking of existing methods, identifying limitations, motivating the development of novel computational tools, and ultimately improving biological insight and therapeutic progress. Through the assessment of strengths and limitations, researchers may develop efficient and scalable algorithms designed explicitly for biological data, integrating graph-based methodologies with machine learning and deep learning approaches. This study is an invaluable tool for new researchers in the area to recognize upcoming trends, including dynamic PPI networks and temporal complex identification.
(© 2025 Wiley Periodicals LLC.)