Mining Large Negative Correlations for High-dimensional Contrasting Analysis
Funding: 2013: $105,000
2014: $105,000
2015: $110,000
Project Member(s): Li, J., Catchpoole, D.
Funding or Partner Organisation: Australian Research Council (ARC Discovery Projects)
Westmead Hospital (Westmead Hospital Partner Funds)
National University of Singapore
Start year: 2013
Summary: Negatively correlated variable groups are studied in many high-dimensional data mining problems. However, the lack of efficient methods for the discovery of this new type of correlation severely limits its application, for example, in gene group contrasting analysis, in financial portfolio construction, and in coupling behavior detection. This project will accomplish scalable algorithms to tackle the exponential complexity, and will develop and establish statistical theories to evaluate and rank the correlations discovered from real-life data sets. The research outcome can advance the knowledge base of data mining substantially, and will enable smart information use in bioinformatics and broadly in finance and social network data analysis.
Publications:
Ghosh, S, Li, J, Cao, L & Ramamohanarao, K 2017, 'Septic shock prediction for ICU patients via coupled HMM walking on sequential contrast patterns', Journal of Biomedical Informatics, vol. 66, pp. 19-31.
View/Download from: Publisher's site
Liu, Y, Peng, H, Wong, L & Li, J 2017, 'High-speed and high-ratio referential genome compression', Bioinformatics, vol. 33, no. 21, pp. 3364-3372.
View/Download from: Publisher's site
Zheng, Y, Ji, B, Song, R, Wang, S, Li, T, Zhang, X, Chen, K, Li, T & Li, J 2016, 'Accurate detection for a wide range of mutation and editing sites of microRNAs from small RNA high-throughput sequencing profiles', Nucleic Acids Research, vol. 44, no. 14, pp. e123-e123.
View/Download from: Publisher's site
Liu, Q, Ren, J, Song, J & Li, J 2015, 'Co-Occurring Atomic Contacts for the Characterization of Protein Binding Hot Spots', PLOS ONE, vol. 10, no. 12, pp. e0144486-e0144486.
View/Download from: Publisher's site
Song, R, Liu, Q, Hutvagner, G, Nguyen, H, Ramamohanarao, K, Wong, L & Li, J 2014, 'Rule discovery and distance separation to detect reliable miRNA biomarkers for the diagnosis of lung squamous cell carcinoma', BMC GENOMICS, vol. 15.
View/Download from: Publisher's site
Li, Z, He, Y, Liu, Q, Zhao, L, Wong, L, Kwoh, CK, Nguyen, H & Li, J 2013, 'Structural analysis on mutation residues and interfacial water molecules for human TIM disease understanding', BMC BIOINFORMATICS, vol. 14, no. SUPPL16, pp. 1-15.
View/Download from: Publisher's site
Keywords: data mining,bioinformatics
FOR Codes: Pattern Recognition and Data Mining, Application Tools and System Utilities, Bioinformatics, Bioinformatics and computational biology, Data mining and knowledge discovery, Information systems, technologies and services not elsewhere classified