Onkar R. Litake

prof_pic.jpg

I am a second year Masters student in the Computer Science department at the University of California - San Diego. My research interests lie in the domain of Machine Learning, particularly Natural Language Processing. I’ve experience working on:

  • AI for Healthcare
  • Low Resource NLP
  • Large Scale Data Creation & Curation

Currently, I’m working towards my thesis journey focusing on AI for Healthcare(Perioperative Learning using Artificial intelligence for Timely patient Optimization), advised by Dr. Gabriel Rodney and co-advised by Dr. Julian McAuley, and with the Dr. Ndapa Nakashole as a member of my thesis committee. We work on topics like Prompt Engineering(Retrieval-Augmented Generation (RAG), ReAct Prompting), model interpretability, and finetuning Large Language Models. Previously I worked with Dr. Pengtao Xie on a data-reweighting-based multi-level optimization framework for domain adaptive paraphrasing in text augmentation. The work has been accepted in Scientific Reports, in the Nature Journal. I’ve also collaborated with Mr. Aman Chadha, curating an extensive dataset for Question-Answering in Hindi and Marathi.

Before coming to UCSD, I did my bacehlor’s at PICT, India. I worked on various NLP projects like Machine Translation, Named Entity Recognition, Hate Detection, Emotion Analysis, and Document Summarisation. I created the first public major gold standard NER dataset in Marathi. I have workshop publications at top NLP/ML conferences like ACL, EMNLP, COLING, AACL, WMT 22 (EMNLP 22), and LREC.

You can learn more about my publications here. You can find my detailed CV here.

I’m excited about connecting with fellow academics! If our research interests align, I’m keen on exploring collaborations or exchanging ideas. Starting March 2024, I’m actively seeking full-time opportunities in the ML/NLP domain. Feel free to contact me via email if there’s a potential match!

News

Jan 02, 2024 Work on text augmentation using doman adaption accepted in Scientific Reports in Nature Journal.
Jan 01, 2024 Started my thesis on AI for Healthcare(Perioperative Learning using Artificial intelligence for Timely patient Optimization).
Nov 30, 2023 Received scholarship from GRADWic to attend AAAI-24
Aug 19, 2023 Created Question-Answering dataset for low-resource Indic languages.
Jun 01, 2023 Started my internship at UCSD Health as ML Researcher.

Selected Publications

  1. L3Cube-MahaNER: A Marathi Named Entity Recognition Dataset and BERT models
    Onkar Litake ,  Maithili Ravindra Sabane ,  Parth Sachin Patil , and 2 more authors
    In Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference , Jun 2022
  2. Mono versus multilingual bert: A case study in hindi and marathi named entity recognition
    Onkar Litake ,  Maithili Sabane ,  Parth Patil , and 2 more authors
    In Proceedings of 3rd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications: ICMISC 2022 , Jun 2023
  3. PICT@DravidianLangTech-ACL2022: Neural Machine Translation On Dravidian Languages
    Aditya Vyawahare ,  Rahul Tangsali ,  Aditya Mandke , and 2 more authors
    In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages , May 2022
  4. Improving long COVID-related text classification: a novel end-to-end domain-adaptive paraphrasing framework
    Sai Ashish Somayajula ,  Onkar Litake ,  Youwei Liang , and 6 more authors
    Scientific Reports, May 2024