Increasing confidence of protein interactomes using network topological metrics
Experimental limitations in high-throughput protein-protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches.
We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally-derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction data sets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction data sets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity.
--by Jin Chen, Wynne Hsu, Mong Li Lee and See-Kiong Ng, Bioinformatics, 2006
We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally-derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction data sets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction data sets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity.
--by Jin Chen, Wynne Hsu, Mong Li Lee and See-Kiong Ng, Bioinformatics, 2006
