CV
You can find my CV here, with a detailed overview of my academic background, professional experience, technical skills, certifications, and achievements. You can download the PDF version of my complete CV by clicking on the button above. If you'd like to know more about my work, feel free to explore the other sections of this website or get in touch with me directly.
Basics
Name | Sweta Pati |
Label | Master's Student in Computer Science |
spati@gmu.edu | |
Phone | +1-571-567-0267 |
Url | https://github.com/swetapati22 |
Summary | A dedicated Master's student in Computer Science specializing in Machine Learning with extensive research and 4+ years of professional experience in Data Science, Machine Learning and Natural Language Processing. |
Education
-
May, 2016 - May, 2020 Odisha, India
-
Aug, 2023 - May, 2025 Fairfax, VA
Master's in Computer Science with Specialization in Machine Learning
George Mason University
GPA: 3.93
Work
-
2024.08 - Present Virginia, USA
Research Assistant
George Mason University
- Working with Prof. Ziyu Yao in the domain of Information Extraction, more briefly explored generalization in Event Extraction tasks through instruction-tuned LLMs aided by diverse set of annotation guidelines.
- Submitted research paper Instruction-Tuning LLMs for Event Extraction with Annotation Guidelines to ACL 2025.
- Improved F1-score by 10% in EE tasks by synthesizing structured guidelines for 500+ events and 4000+ arguments, and fine-tuning LLaMA-3.1 8B using LoRA and regularization techniques.
- Designed a custom evaluation script to ensure robust benchmarking for Event Argument Extraction (EAE), Event Detection (ED), and End-to-End (E2E) EE tasks.
-
2024.06 - 2024.08 Virginia, USA
Summer Research Intern
George Mason University
- Working with Prof. Ziyu Yao in the domain of Information Extraction, more briefly explored generalization in Event Extraction tasks through instruction-tuned LLMs aided by diverse set of annotation guidelines.
- Reduced hallucinations and enhanced evaluation consistency by developing a preprocessing pipeline that converts unstructured text into structured Python code prompts.
- Enhanced multi-domain generalization by constructing an ontology for event and argument identification, applicable to 3 EE tasks across 8 domains.
- Enhanced model interpretability and performance by generating synthetic annotation guidelines for 500+ events and 4000+ arguments using GPT-4, enabling more consistent event extraction.
-
2020.09 - 2023.08 Odisha, India
Data Scientist
Highradius Technologies Private Limited
Implemented and deployed machine learning models for Fortune 500 companies, enhancing operational efficiency and predictive accuracy.
- Increased dollar recovery rate by 12% implementing imbalanced classification ML model (DVP) for fraud detection with 85% average accuracy for 10+ fortune 500 clients.
- Increased sales acceleration and reduced business disruptions by 20% through the optimization of credit decisioning for order release using predictive models.
- Reduced Days Sales Outstanding (DSO) by 30% by building a regression model for Payment Date Prediction (PDP).
- Accelerated DVP and PDP deployment by 48% and 57%, automating the model building and deployment pipelines.
- Productionized ML models for 10+ Fortune 500 clients like Nestle, Kraft Heinz, Clorox, Uber, Kellogg’s, etc.
- Enhanced stakeholder decision-making by visualizing model insights and business impact using Tableau.
- Developed a digital assistant, Freeda, used by Fortune 500 companies to understand intent behind user queries and provide appropriate responses for multiple Highradius cloud products.
- Mentored over 15 interns in data science and business concepts during the 2021 and 2022 internship programs.
-
2018.06 - 2018.06 West Bengal, India
Research Intern
Indian Institute of Technology, Kharagpur
Developed BCH error correction codes and tested OpenSSL authentication protocols.
- Implemented BCH error correction code and compared existing test suites in OpenSSL with the proposed authentication protocol.
- Published a research paper titled 'An OpenSSL Extension for PUF-based Authentication' in IEEE DSP 2018.
Projects
-
Project: Medi-Buddy - Multi-Agent Framework for Medical Insights
Developed a multi-agent framework using LLaMA-3.3 70B to extract and summarize evidence-based medical insights from PubMed and Wikipedia.
- Integrated multiple data sources including PubMed, Wikipedia, and DuckDuckGo via APIs.
- Leveraged Groq’s inference API and Phidata orchestration to create a streamlined agent workflow.
- Focused on delivering accurate, explainable, and domain-specific responses in the medical domain using cutting-edge LLMs.
-
Research Project: Event Extraction
Enhancing Event Extraction using instruction-tuned LLMs to achieve superior performance on unseen domains.
- Improved F1-score by 10% in EE tasks by synthesizing structured guidelines for 500+ events and 4000+ arguments, and fine-tuning LLaMA-3.1 8B using LoRA and regularization techniques.
- Reduced hallucinations and enhanced evaluation consistency by developing a preprocessing pipeline that converts unstructured text into structured Python code prompts.
- Enhanced multi-domain generalization by constructing an ontology for event and argument identification, applicable to 3 EE tasks across 8 domains.
- Designed a custom evaluation script to ensure robust benchmarking for Event Argument Extraction (EAE), Event Detection (ED), and End-to-End (E2E) EE tasks.
-
Project: Multi-Cloud AI-Driven DevOps
Designed a scalable, AI-powered e-commerce platform using infrastructure and services across AWS, GCP, and Azure.
- Provisioned infrastructure using Terraform with AWS EC2, EKS, and DynamoDB.
- Containerized and deployed frontend/backend apps on AWS EKS using Kubernetes.
- Built CI/CD pipeline with GitHub and AWS CodePipeline for seamless deployments.
- Integrated Amazon Bedrock for product recommendations and OpenAI GPT-4o for customer support.
- Connected AWS Lambda to Google BigQuery and used Azure AI for sentiment analysis.
- Enabled smooth cloud-to-cloud data exchange ensuring system resilience.
-
Project: Gentopia-Mason - Enhancing LLM Agents with Currency Conversion & PDF Reader
Extended Gentopia to support real-time currency conversion and document extraction with LLM-powered agents.
- Built Currency Conversion Agent using Fixer.io API for real-time FX data.
- Developed PDF Reader Agent using PyPDF2 for extracting structured information from documents.
- Enabled financial, document, and general-purpose NLP capabilities using modular agents.
-
Project: Counterhate Arguments - Enhancing Online Discourse Against Hate Speech
Reproduced and extended EMNLP study on counterhate arguments to tackle online hate speech effectively.
- Implemented full NLP pipeline for data preprocessing, model training, and evaluation.
- Trained and fine-tuned models like RoBERTa and Longformer for classification tasks.
-
Project: YouTube Global Statistics Analytics
Developed an interactive web dashboard to visualize global YouTube trends across countries and categories.
- Built six interactive visualizations using Plotly including bar charts, line graphs, heatmaps, and choropleth maps.
- Implemented a global slider filter for dynamic data analysis by view thresholds.
-
Project: Multilingual Sentence Classification for Media Framing and Bias Detection
Fine-tuned BERT for multilingual sentence classification to detect media bias with 15-class taxonomy.
- Achieved 70% classification accuracy using 450+ annotated sentences.
- Applied data augmentation techniques like translation and paraphrasing to boost model robustness.
-
Project: Dynamic Bus Routing System
Developed an algorithmic solution to alleviate congestion on specific routes through dynamic allocation of buses.
- Designed a system that dynamically allocates buses based on capacity (via ticketing system) and demand within each route.
- Implemented YOLO object detection to calculate demand at each bus stop and inside buses.
-
Project: Dynamic Traffic Management System
Created an algorithmic approach to optimize traffic light timing in urban environments.
- Used traffic flow analysis to reduce congestion at key intersections.
Skills
Programming Languages | |
Python | |
SQL | |
PyTorch | |
Numpy | |
Pandas | |
Scikit-learn | |
PySpark |
Data Science and Machine Learning | |
Statistics | |
Supervised | |
Unsupervised | |
Classification | |
Regression | |
Ensemble | |
Bagging | |
Boosting | |
SVM | |
Random Forest | |
A/B Testing | |
Feature Engineering | |
Feature Selection | |
Model Monitoring |
NLP and Deep Learning | |
Transformers | |
Event Extraction | |
Neural Networks | |
RNN | |
CNN |
Large Language Models | |
LLMs (GPT, LLaMA, BERT) | |
Fine-Tuning | |
PEFT | |
Instruction Tuning | |
Prompt Engineering | |
RAG | |
Agentic AI | |
LangChain | |
Vector DBs |
Data Visualization | |
Data Mining | |
Data Analysis | |
Tableau | |
Seaborn | |
Plotly | |
Matplotlib | |
Streamlit |
MLOps and Deployment | |
CI/CD | |
Docker | |
Kubernetes | |
Terraform | |
AWS | |
Google Cloud (GCP) | |
Weights & Biases | |
MLflow |
Tools and Platforms | |
GitHub | |
Jira | |
Hugging Face | |
OpenAI API | |
Groq API | |
HPC Environments |
Certificates
Hugging Face Agents Course - Unit 1 | ||
Hugging Face | 2025-04 |
Google AI Essential | ||
2025-03 |
Event-Driven Agentic Document Workflows | ||
Deep Learning.AI and LlamaIndex | 2025-03 |
Awards
- 2020.2021
Best Performer of the Team
Highradius Technologies
Awarded for exceptional contributions across three quarters.
- 2019.03
Winner, Smart India Hackathon: National Level Hackathon by Govt. of INDIA
Government of INDIA
Recognized for developing an algorithmic solution to alleviate congestion on throughout all routes within a city through dynamic allocation of buses.
Languages
English | |
Fluent |
Hindi | |
Fluent |
Odia | |
Native Speaker |