Balu Bhasuran
Visiting Assistant Professor
I am currently a Visiting Assistant Professor in School of Information (iSchool), College of Communication & Information at Florida State University. Prior to this I was a Post Doctoral Fellow at eHealth Lab at iSchool in FSU working under Prof. Zhe He. Prior to this I was a Post Doctoral Fellow at The Bakar Computational Health Sciences Institute (BCHSI), University of California, San Francisco under Dr. Vivek Rudrapatna in the The Real-World Evidence Lab. I have a Ph.D. in Biomedical Informatics under Prof. Jeyakumar Natarajan in the DRDO-BU Center for Life Sciences, TamilNadu, India. I am a proud member of the eHealth Lab @ FSU, Real World Evidence Lab @UCSF, Data Mining and Text Mining Laboratory @BU and Computational Biology Division @ DRDO-BU CLS.
I am a researcher specializing in Information Extraction and Machine Learning within Clinical Data Science and Biomedicine. My current work centers on Clinical NLP, Machine Learning, and Generative AI. With over 10 years of experience, I have developed various classification and prediction models using biomedical data. My interests include causal inference, literature-based discovery, network visualization, knowledge graphs, and ensuring fairness and explainability in machine learning models.
Education
Florida State University Tallahassee, FL
Postdoctoral researcher Jul ’23 –Sep' 24
Mentor: Zhe He, PhD (zhe.he@cci.fsu.edu)
• Supervised investigation of LLMs response to patient-centric lab test result-related questions.
• Investigation into Retrieval augmented Generation (RAG) based LLM systems for patient-centric questions
• Development of a machine learning prediction model for pediatric transplantation rejection using EHR and
transplant registry data
• Investigation of LLMs repones to clinical case report-based differential diagnosis using lab test data.
• Investigation of the role of seasonality in lab test results of Alzheimer’s and Dementia patients using EHR data.
University of California, San Francisco San Francisco, CA
Postdoctoral researcher Jan ’20 – Jul ’23
Mentor: Vivek Rudrapatna, MD, PhD (Vivek.Rudrapatna@ucsf.edu)
• Trained and clinically tested an early diagnosis prediction model for a rare disease using multicenter
electronic health records (UCSF and UCLA)
• Developed a time-to-event model for Non-alcoholic steatohepatitis (NASH) using EHR
• Developed a natural language processing algorithm for identifying NASH patients from clinical notes
• Developed a text classification algorithm for Mayo endoscopic sub-scores using clinical notes.
• Patent pending for rare disease early diagnosis using multicenter electronic health records
• Patent pending for FDA collaboration on the use of LLMs for adverse event detection from clinical notes.
Bharathiar University India
PhD, Computational Biology Sep ’14 – Dec ’20
Thesis: Biomedical Text Mining Approaches: Applications in Disease Entity Recognition, Gene-Disease Association
Extraction and Knowledge Discovery
Topics: Information Extraction, Biomedical Informatics
Advisor: Prof. Jeyakumar Natarajan, PhD (n.jayakumar@yahoo.co.in)
DRDO-BU Center for Life Sciences India
Junior and Senior Research Fellow Jul ’14 – Jun ’2018
Mentor: Jeyakumar Natarajan, PhD (n.jayakumar@yahoo.co.in)
• Machine learning models text mining discovery of High-Altitude Diseases
Mahatma Gandhi University India
MCA, Masters in Computer Applications Oct ’10 – Mar ’13
Thesis: Data Mining to predict efficient allocations in electrical grids
Relevant Coursework: Data structures, Algorithm Analysis and Design
Research Interests
I am a researcher focuses on Information extraction and Machine learning in Clinical data science and Biomedicine. I have 8+ years of experience in developing classification and prediction models. I am interested in Natural Language Processing, Electronic Health Records, Machine Learning and Information Extraction I am also exploring the recent trends in large langugale models (LLMs), causal inference, literature based discovery (LBD), network visualization, knowledge graphs (KGs), and explainable AI (XAI).
Teaching Interests
Natural Language Processing, Data Mining, Text Mining, Machine Learning
Publications & Research
- Balu Bhasuran, Katharina Schmolly, Yuvraaj Kapoor, Nanditha Lakshmi Jayakumar, Raymond Doan, Jigar Amin, Stephen Meninger, Nathan Cheng, Robert Deering, Karl Anderson, Simon W Beaven, Bruce Wang, Vivek A Rudrapatna. Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning Journal of the American Medical Informatics Association.Full text available at JAMIA
- Vivek A Rudrapatna and Balu Bhasuran. Methods for improving the diagnosis of rare diseases using electronic health records data and systems for same World Intellectual Property Organization Full text available at WIPO
- Balu Bhasuran, Sharanya Manoharan, Oviya Ramalakshmi Iyyappan, Gurusamy Murugesan, Archana Prabahar, Kalpana Raja. Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression Biomedicines.Full text available at Biomedicines
- Zhe He, Balu Bhasuran, Qiao Jin, Shubo Tian, Karim Hanna, Cindy Shavor, Lisbeth Garcia Arguello, Patrick Murray, Zhiyong Lu. Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study Journal of Medical Internet Research26 (2024): e56655.Full text available at JMIR
- Anna L Silverman, Balu Bhasuran, Arman Mosenia, Fatema Yasini, Gokul Ramasamy, Imon Banerjee, Saransh Gupta, Taline Mardirossian, Rohan Narain, Justin Sewell, Atul J Butte, Vivek A Rudrapatna. Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores from Colonoscopy Reports Inflammatory Bowel Diseases, izae068.Full text available at IBD
- Perseus V Patel, Amy Zhang, Balu Bhasuran, Vignesh G Ravindranath, Melvin B Heyman, Sofia G Verstraete, Atul J Butte, Michael J Rosen, Vivek A Rudrapatna, ImproveCareNow Pediatric IBD Learning Health System. Real-world effectiveness of ustekinumab and vedolizumab in TNF-exposed pediatric patients with ulcerative colitis Journal of Pediatric Gastroenterology and Nutrition, Full text available at JPGN
- Anna L Silverman, Madhumita Sushil, Balu Bhasuran, Dana Ludwig, James Buchanan, Rebecca Racz, Mahalakshmi Parakala, Samer El‐Kamary, Ohenewaa Ahima, Artur Belov, Lauren Choi, Monisha Billings, Yan Li, Nadia Habal, Qi Liu, Jawahar Tiwari, Atul J Butte, Vivek A Rudrapatna, Algorithmic Identification of Treatment-Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease Clinical Pharmacology & Therapeutics, Full text available at CPT
- Balu Bhasuran, Shadera Slatter, Gail Fernandes, Boshu Ru, Joe Yang, Xiao Zhang, Ravi Shankar, Jin Ge, Vivek Rudrapatna, S1394 NASHDetection: A Natural Language Processing Method for Identifying Patients With Non-alcoholic Steatohepatitis Using Clinical Notes The American Journal of Gastroenterology, Full text available at ACG
- Balu Bhasuran, Shadera Slatter, Gail Fernandes, Boshu Ru, Joe Yang, Xiao Zhang, Ravi Shankar, Vivek Rudrapatna,Jin Ge. S1395 Uncontrolled Diabetes and Hypertension Are Associated With the Risk of New-Onset Cirrhosis in Patients With Nonalcoholic Steatohepatitis The American Journal of Gastroenterology, Full text available at ACG
- Abhivyakti Yadav, Balu Bhasuran, IR Oviya. A Novel Approach for Classifying DNA Barcodes Using Ensemble NLP Models 2023 International Conference on Research Methodologies in Knowledge Management, Artificial Intelligence and Telecommunication EngineeringFull text available at IEEE
- Abhiram Kunchapu, IR Oviya, Balu Bhasuran. Precision Enhanced Breast Cancer Prediction Using Deep Learning Models 2023 International Conference on Artificial Intelligence for Innovations in Healthcare Industries (ICAIIHI)Full text available at IEEE
- Balu Bhasuran, and Jeyakumar Natarajan. Automatic extraction of gene-disease associations from literature using joint ensemble learning. PloS one. 2018 Jul 26;13(7):e0200699.Full text available at Plos One
- Balu Bhasuran, Gurusamy Murugesan, Sabenabanu Abdulkadhar, and Jeyakumar Natarajan. "Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases." Journal of biomedical informatics 64 (2016): 1-9.Full text available at JBI
- Balu Bhasuran, and Jeyakumar Natarajan. "DisGeReExT: a knowledge discovery system for exploration of disease–gene associations through large-scale literature-wide analysis study." Knowledge and Information Systems (2023): 1-25.Full text available at KAIS
- Balu Bhasuran, Devika Subramanian, and Jeyakumar Natarajan. "Text Mining and Network Analysis to Find Functional Associations of Genes in High Altitude Diseases." Computational Biology and Chemistry (2018).Full text available at CBAC
- Balu Bhasuran "BioBERT and Similar Approaches for Relation Extraction." Biomedical Text Mining. New York, NY: Springer US, 2022. 221-235.Full text available at Biomedical Text Mining
- Balu Bhasuran "Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries." Biomedical Text Mining. New York, NY: Springer US, 2022. 123-140.Full text available at Biomedical Text Mining
- Silverman, Anna L., Balu Bhasuran, Arman Mosenia, Fatema Yasini, Saransh Gupta, Narbe Mardirossian, Rohan Narain, Justin L. Sewell, Atul Butte, and Vivek A. Rudrapatna. "ACCURATE, ROBUST, AND SCALABLE ABSTRACTION OF MAYO ENDOSCOPIC SUBSCORES FROM COLONOSCOPY REPORTS." In GASTROENTEROLOGY, vol. 162, no. 7, pp. S618-S619. 1600 JOHN F KENNEDY BOULEVARD, STE 1800, PHILADELPHIA, PA 19103-2899 USA: WB SAUNDERS CO-ELSEVIER INC, 2022. Full text available at GASTROENTEROLOGY
- Balu Bhasuran, and Jeyakumar Natarajan.(2019) Distant Supervision for Large-Scale Extraction of Gene–Disease Associations from Literature Using DeepDive. In: Bhattacharyya S., Hassanien A., Gupta D., Khanna A., Pan I. (eds) International Conference on Innovative Computing and Communications. Lecture Notes in Networks and Systems, vol 56. Springer, Singapore. Full text available at ICICC
- Subramanian, Devika, Balu Bhasuran, and Jeyakumar Natarajan. "Genomic analysis of RNA-Seq and sRNA-Seq data identifies potential regulatory sRNAs and their functional roles in Staphylococcus aureus." Genomics (2018).Full text available at Genomics
- Maroli, Nikhil,Balu Bhasuran, Jeyakumar Natarajan, and Ponmalai Kolandaivel. "The potential role of procyanidin as a therapeutic agent against SARS-CoV-2: a text mining, molecular docking and molecular dynamics simulation approach." Journal of Biomolecular Structure and Dynamics (2020): 1-16. Full text available at JBSD
- Abdulkadhar, Sabenabanu, Balu Bhasuran, and Jeyakumar Natarajan. "Multiscale Laplacian graph kernel combined with lexico-syntactic patterns for biomedical event extraction from literature." Knowledge and Information Systems (2020): 1-31.Full text available at KAIS
- Natarajan, Jeyakumar, Balu Bhasuran, and Gurusamy Murugesan. "Big Data Analytics: A Text Mining Perspective and Applications in Biomedicine and Healthcare." Big Data Applications in Industry 4.0. Auerbach Publications 367-408. Full text available at Big Data Analytics
- Gnanasegar, S. M.,Balu Bhasuran, and J. Natarajan. "A long short-term memory deep learning network for MRI based Alzheimer’s disease dementia classification." J Appl Bioinforma Comput Biol 9: 6. doi: 10.37532/jabcb. 2020.9 (6) 187 (2020): 2. Full text available at JABCB
- Maroli, Nikhil, Naveen Kumar Kalagatur, Balu Bhasuran, Achuth Jayakrishnan, Renuka Ramalingam Manoharan, Ponmalai Kolandaivel, Jeyakumar Natarajan, and Krishna Kadirvelu. "Molecular Mechanism of T-2 Toxin-Induced Cerebral Edema by Aquaporin-4 Blocking and Permeation." Journal of chemical information and modeling 59, no. 11 (2019): 4942-4958. Full text available at JCIM
- Gurusamy Murugesan, Sabenabanu Abdulkadhar, Balu Bhasuran, and Jeyakumar Natarajan. "BCC-NER: bidirectional, contextual clues named entity tagger for gene/protein mention recognition." EURASIP Journal on Bioinformatics and Systems Biology 2017, no. 1 (2017): 7. Full text available at JBSB
- Han, Wenshan, Balu Bhasuran*, Victorine Muse, Soren Brunak, Lifeng Lin, Karim Hanna, Yu Huang, Jiang Bian, and Zhe He. "Assessing the Seasonality of Lab Tests Among Patients with Alzheimer's Disease and Related Dementias in OneFlorida Data Trust." medRxiv (2024): 2024-03. AMIA 2024 Annual Symposium (Accepted)
- Balu Bhasuran*, Gurusamy Murugesan, and Jeyakumar Natarajan. "Literature Based Discovery (LBD): Towards Hypothesis Generation and Knowledge Discovery in Biomedical Text Mining." arXiv preprint arXiv:2310.03766 (2023). (Under review in ACM Computing Surveys), 2024
Grants & Awards
International Patent, Methods for improving the diagnosis of rare diseases using electronic health records data and systems for same, Inventor: Vivek RUDRAPATNA, Balu BHASURAN, WO2024049873A1 WIPO (PCT), 2024-03-07
American Transplant Congress Pediatric Poster Award Winner” for this presentation:
He, Z, Bhasuran, B., Wang, X., Gupta, D., & Killian, M. O. (June, 2024). Predicting Organ Rejections for Pediatric Heart Transplantations with a Combined Use of Transplant Registry Data and Electronic Health Records. Poster presented at American Transplant Congress (ATC), Philadelphia, PA.
Best paper award, Data Mining, International Conference on Innovative Computing and Communication (ICICC-2018), Springer, India
Best poster award, Text Mining, International Symposium on Computational Biology and Bioinformatics (BioIndica2016),India
Best poster award, Text Mining, Fifth Edition of National Workshop on Computer Vision, Image Processing Techniques and Data Analytics, India, 2015
Indian Copy Right, Computer Software, SW-15957/2023, D-NER: Disease Name Recognizer from Literature, First Author
Indian Copy Right, Computer Software, SW-15958/2023, DisGeReExt: Disease Gene Relation Extractor from Literature,First Author
Indian Copy Right, Computer Software, SW-15974/2023, GD Miner: Gene-Disease Association Mining From Literature, First Author
Filed Under: The Chairman, Defence Research & Development Organization (DRDO), GOVT OF INDIA, DTE OF ER & IPR