I'm a Data Scientist & ML Engineer based in Gurugram, India, currently working at SkillUpTech. I specialize in end-to-end AI solutions — from NLP pipelines and LLM fine-tuning to cloud deployments on AWS and GCP.
I hold a B.Tech in Computer Engineering (Data Science) from J.C. Bose University, YMCA Faridabad. I'm driven by the challenge of turning messy, real-world data into clear, actionable insights.
Outside of work, I contribute to open-source game mods, build personal projects, and always have something new in the queue to learn.
Python
Machine Learning
Deep Learning
NLP / spaCy
SQL
Node.js
Git
Tableau / Power BI
WebSocketsSpearheaded end-to-end design and deployment of production-grade AI solutions, driving measurable business impact across NLP, LLM fine-tuning, and intelligent automation.
Fine-tuned a Qwen 2.5 0.5B instruction model on local hardware using QLoRA (4-bit quantization + LoRA adapters via PEFT/TRL) on the DialogSum conversational summarisation dataset. Achieved a +5% gain in BERTScore F1 (0.8381 → 0.8805) — demonstrating resource-efficient LLM specialization.
Designed and deployed a multi-agent password reset workflow in n8n with OTP generation and token auth. Separately, built a Gemini-based RAG agent on GCP Vertex AI with vector search — significantly reducing LLM hallucinations.
ML web application predicting insurance claim approvals using Decision Trees and Random Forest. Achieved 89% accuracy — a 20% improvement over baseline predictions. Visualized with Matplotlib.
Live messaging platform with bidirectional WebSocket communication via Socket.IO. Flask backend, dynamic HTML/JS frontend. Message latency consistently under 200ms.
CNN-based MNIST digit classifier achieving 97% test accuracy. Used TensorFlow/Keras with Logistic Regression and SVM baselines. Supports real-time input recognition.
NLP model detecting depressive/negative sentiment in social media posts. NLTK preprocessing, Logistic Regression, WordCloud visualizations. 82% accuracy on 10,000+ tweets.
EDA on historical sales data using Pandas, Seaborn, and Matplotlib. Interactive dashboards built in Tableau and Power BI. Resulted in a 15% forecast accuracy improvement.
I'm open to data science, ML engineering, and software development roles. Have a project or opportunity in mind? Let's talk.