CV | Sweta Pati's Personal Website

Basics

Name	Sweta Pati
Label	Master's Student in Computer Science
Email	spati@gmu.edu
Phone	+1-571-567-0267
Url	https://github.com/swetapati22
Summary	A dedicated Master's student in Computer Science specializing in Machine Learning with extensive research and 4+ years of professional experience in Data Science, Machine Learning and Natural Language Processing.

Education

May, 2016 - May, 2020

Odisha, India
Bachelor's in Computer Science and Engineering

Kalinga Institute of Industrial Technology

GPA: 3.60
Aug, 2023 - May, 2025

Fairfax, VA
Master's in Computer Science with Specialization in Machine Learning

George Mason University

GPA: 3.93

Work

2024.08 - Present

Virginia, USA
Research Assistant

George Mason University
- Working with Prof. Ziyu Yao in the domain of Information Extraction, more briefly explored generalization in Event Extraction tasks through instruction-tuned LLMs aided by diverse set of annotation guidelines.
- Submitted research paper Instruction-Tuning LLMs for Event Extraction with Annotation Guidelines to ACL 2025.
- Improved F1-score by 10% in EE tasks by synthesizing structured guidelines for 500+ events and 4000+ arguments, and fine-tuning LLaMA-3.1 8B using LoRA and regularization techniques.
- Designed a custom evaluation script to ensure robust benchmarking for Event Argument Extraction (EAE), Event Detection (ED), and End-to-End (E2E) EE tasks.
2024.06 - 2024.08

Virginia, USA
Summer Research Intern

George Mason University
- Working with Prof. Ziyu Yao in the domain of Information Extraction, more briefly explored generalization in Event Extraction tasks through instruction-tuned LLMs aided by diverse set of annotation guidelines.
- Reduced hallucinations and enhanced evaluation consistency by developing a preprocessing pipeline that converts unstructured text into structured Python code prompts.
- Enhanced multi-domain generalization by constructing an ontology for event and argument identification, applicable to 3 EE tasks across 8 domains.
- Enhanced model interpretability and performance by generating synthetic annotation guidelines for 500+ events and 4000+ arguments using GPT-4, enabling more consistent event extraction.
2020.09 - 2023.08

Odisha, India
Data Scientist

Highradius Technologies Private Limited

Implemented and deployed machine learning models for Fortune 500 companies, enhancing operational efficiency and predictive accuracy.
- Increased dollar recovery rate by 12% implementing imbalanced classification ML model (DVP) for fraud detection with 85% average accuracy for 10+ fortune 500 clients.
- Increased sales acceleration and reduced business disruptions by 20% through the optimization of credit decisioning for order release using predictive models.
- Reduced Days Sales Outstanding (DSO) by 30% by building a regression model for Payment Date Prediction (PDP).
- Accelerated DVP and PDP deployment by 48% and 57%, automating the model building and deployment pipelines.
- Productionized ML models for 10+ Fortune 500 clients like Nestle, Kraft Heinz, Clorox, Uber, Kellogg’s, etc.
- Enhanced stakeholder decision-making by visualizing model insights and business impact using Tableau.
- Developed a digital assistant, Freeda, used by Fortune 500 companies to understand intent behind user queries and provide appropriate responses for multiple Highradius cloud products.
- Mentored over 15 interns in data science and business concepts during the 2021 and 2022 internship programs.
2018.06 - 2018.06

West Bengal, India
Research Intern

Indian Institute of Technology, Kharagpur

Developed BCH error correction codes and tested OpenSSL authentication protocols.
- Implemented BCH error correction code and compared existing test suites in OpenSSL with the proposed authentication protocol.
- Published a research paper titled 'An OpenSSL Extension for PUF-based Authentication' in IEEE DSP 2018.

Projects

Project: Medi-Buddy - Multi-Agent Framework for Medical Insights

Developed a multi-agent framework using LLaMA-3.3 70B to extract and summarize evidence-based medical insights from PubMed and Wikipedia.
- Integrated multiple data sources including PubMed, Wikipedia, and DuckDuckGo via APIs.
- Leveraged Groq’s inference API and Phidata orchestration to create a streamlined agent workflow.
- Focused on delivering accurate, explainable, and domain-specific responses in the medical domain using cutting-edge LLMs.
Research Project: Event Extraction

Enhancing Event Extraction using instruction-tuned LLMs to achieve superior performance on unseen domains.
- Improved F1-score by 10% in EE tasks by synthesizing structured guidelines for 500+ events and 4000+ arguments, and fine-tuning LLaMA-3.1 8B using LoRA and regularization techniques.
- Reduced hallucinations and enhanced evaluation consistency by developing a preprocessing pipeline that converts unstructured text into structured Python code prompts.
- Enhanced multi-domain generalization by constructing an ontology for event and argument identification, applicable to 3 EE tasks across 8 domains.
- Designed a custom evaluation script to ensure robust benchmarking for Event Argument Extraction (EAE), Event Detection (ED), and End-to-End (E2E) EE tasks.
Project: Multi-Cloud AI-Driven DevOps

Designed a scalable, AI-powered e-commerce platform using infrastructure and services across AWS, GCP, and Azure.
- Provisioned infrastructure using Terraform with AWS EC2, EKS, and DynamoDB.
- Containerized and deployed frontend/backend apps on AWS EKS using Kubernetes.
- Built CI/CD pipeline with GitHub and AWS CodePipeline for seamless deployments.
- Integrated Amazon Bedrock for product recommendations and OpenAI GPT-4o for customer support.
- Connected AWS Lambda to Google BigQuery and used Azure AI for sentiment analysis.
- Enabled smooth cloud-to-cloud data exchange ensuring system resilience.
Project: Gentopia-Mason - Enhancing LLM Agents with Currency Conversion & PDF Reader

Extended Gentopia to support real-time currency conversion and document extraction with LLM-powered agents.
- Built Currency Conversion Agent using Fixer.io API for real-time FX data.
- Developed PDF Reader Agent using PyPDF2 for extracting structured information from documents.
- Enabled financial, document, and general-purpose NLP capabilities using modular agents.
Project: Counterhate Arguments - Enhancing Online Discourse Against Hate Speech

Reproduced and extended EMNLP study on counterhate arguments to tackle online hate speech effectively.
- Implemented full NLP pipeline for data preprocessing, model training, and evaluation.
- Trained and fine-tuned models like RoBERTa and Longformer for classification tasks.
Project: YouTube Global Statistics Analytics

Developed an interactive web dashboard to visualize global YouTube trends across countries and categories.
- Built six interactive visualizations using Plotly including bar charts, line graphs, heatmaps, and choropleth maps.
- Implemented a global slider filter for dynamic data analysis by view thresholds.
Project: Multilingual Sentence Classification for Media Framing and Bias Detection

Fine-tuned BERT for multilingual sentence classification to detect media bias with 15-class taxonomy.
- Achieved 70% classification accuracy using 450+ annotated sentences.
- Applied data augmentation techniques like translation and paraphrasing to boost model robustness.
Project: Dynamic Bus Routing System

Developed an algorithmic solution to alleviate congestion on specific routes through dynamic allocation of buses.
- Designed a system that dynamically allocates buses based on capacity (via ticketing system) and demand within each route.
- Implemented YOLO object detection to calculate demand at each bus stop and inside buses.
Project: Dynamic Traffic Management System

Created an algorithmic approach to optimize traffic light timing in urban environments.
- Used traffic flow analysis to reduce congestion at key intersections.

Skills

	Programming Languages
	Python
	SQL
	PyTorch
	Numpy
	Pandas
	Scikit-learn
	PySpark

	Data Science and Machine Learning
	Statistics
	Supervised
	Unsupervised
	Classification
	Regression
	Ensemble
	Bagging
	Boosting
	SVM
	Random Forest
	A/B Testing
	Feature Engineering
	Feature Selection
	Model Monitoring

	NLP and Deep Learning
	Transformers
	Event Extraction
	Neural Networks
	RNN
	CNN

	Large Language Models
	LLMs (GPT, LLaMA, BERT)
	Fine-Tuning
	PEFT
	Instruction Tuning
	Prompt Engineering
	RAG
	Agentic AI
	LangChain
	Vector DBs

	Data Visualization
	Data Mining
	Data Analysis
	Tableau
	Seaborn
	Plotly
	Matplotlib
	Streamlit

	MLOps and Deployment
	CI/CD
	Docker
	Kubernetes
	Terraform
	AWS
	Google Cloud (GCP)
	Weights & Biases
	MLflow

	Tools and Platforms
	GitHub
	Jira
	Hugging Face
	OpenAI API
	Groq API
	HPC Environments

Certificates

	Hugging Face Agents Course - Unit 1
	Hugging Face	2025-04

	Google AI Essential
	Google	2025-03

	Event-Driven Agentic Document Workflows
	Deep Learning.AI and LlamaIndex	2025-03

	Oracle Cloud Infrastructure 2024 Generative AI Certified Professional
	Oracle	2024-08

Awards

2020.2021

Best Performer of the Team

Highradius Technologies

Awarded for exceptional contributions across three quarters.
2019.03

Winner, Smart India Hackathon: National Level Hackathon by Govt. of INDIA

Government of INDIA

Recognized for developing an algorithmic solution to alleviate congestion on throughout all routes within a city through dynamic allocation of buses.

Languages

	English
	Fluent

	Hindi
	Fluent

	Odia
	Native Speaker

Basics

Education

Kalinga Institute of Industrial Technology

GPA: 3.60

George Mason University

GPA: 3.93

Work

George Mason University

George Mason University

Highradius Technologies Private Limited

Implemented and deployed machine learning models for Fortune 500 companies, enhancing operational efficiency and predictive accuracy.

Indian Institute of Technology, Kharagpur

Developed BCH error correction codes and tested OpenSSL authentication protocols.

Projects

Developed a multi-agent framework using LLaMA-3.3 70B to extract and summarize evidence-based medical insights from PubMed and Wikipedia.

Enhancing Event Extraction using instruction-tuned LLMs to achieve superior performance on unseen domains.

Designed a scalable, AI-powered e-commerce platform using infrastructure and services across AWS, GCP, and Azure.

Extended Gentopia to support real-time currency conversion and document extraction with LLM-powered agents.

Reproduced and extended EMNLP study on counterhate arguments to tackle online hate speech effectively.

Developed an interactive web dashboard to visualize global YouTube trends across countries and categories.

Fine-tuned BERT for multilingual sentence classification to detect media bias with 15-class taxonomy.

Developed an algorithmic solution to alleviate congestion on specific routes through dynamic allocation of buses.

Created an algorithmic approach to optimize traffic light timing in urban environments.

Skills

Certificates

Awards

Highradius Technologies

Awarded for exceptional contributions across three quarters.

Government of INDIA

Recognized for developing an algorithmic solution to alleviate congestion on throughout all routes within a city through dynamic allocation of buses.

Languages