ML Researcher — NLP

Job Openings ML Researcher — NLP

About the job ML Researcher — NLP

Role Overview

This is a full time position, involving the development of Adalat AI's Legal Large Language Models and NLP systems. You will be part of the team in charge of the development and deployment of the Legal Large Language Models.

As an early member of the Team, you will:
Work closely with founding team to develop the models that power our legal copilot for Judges and Stenographers
Identify and drive innovative solutions to address the most critical needs of our Users (Judges at Courts of Different Stages in India).
Work in close collaboration with cross-functional partners in design, backend and frontend functions.
Solve complex engineering problems for ML Platform. Build cost effective and scalable systems.

Key Responsibilities

Design and implement NLP and language processing systems using machine learning and deep learning techniques.
Preprocess, annotate, and manage large datasets of legal text for training and testing purposes.
Develop data augmentation techniques to improve model robustness and generalization.
Train and fine-tune large language models and NLP models.
Implement and experiment with state-of-the-art algorithms to optimize model performance.
Conduct rigorous evaluation of NLP models and provide insights for model improvement.
Collaborate with Product and Engineering to understand user requirements and provide technical solutions.
Ensure compliance with data privacy and security regulations in projects.
Stay updated with state-of-the-art techniques in the NLP field and exchange knowledge with colleagues.
Coordinate with internal teams to translate business challenges into data pipelines and model frameworks.
Document research findings, methodologies, and implementation details.
Communicate progress and results to the team and stakeholders effectively.

Qualifications

Dont worry about ticking all boxes

3+ Years working with NLP and Large Language Models
Knowledge of NLP technologies like Large Language Models, Text Classification, Named Entity Recognition, Information Extraction, Question Answering, Text Summarization.
Hands-on experience in building transformer-based models using architectures like BERT, GPT, T5, LLaMA.
Hands-on experience in fine-tuning large language models for domain-specific applications.
Familiarity with machine learning and deep learning libraries such as Scikit Learn, TensorFlow, PyTorch, and Transformers.
Strong programming experience in languages like Python, Shell scripting etc. A Bachelor's or Master's in Computer Science, Electrical Engineering, or a related field from leading institutes. (a plus)
Experience contributing to research communities, including publications at conferences and/or journals (a plus)

What You Will Achieve in a Year

Built the ML Stack for Court Systems in India to cater to 5000+ courtrooms running 8-10 hrs per day in the first year
Take on some of the hardest ML challenges of company, like building feedback loops to improve NLP performance for legal domain and 10+ Indian Languages.
Built the best multilingual NLP models in Translation and Summarisation for Legal domain.