"95% of Executives Say Data Drives Business Strategy — So I Build Solutions That Matter"

Hi, I'm Arsalan Anwar

AI Product Developer|

Transforming complex data into actionable insights and scalable AI solutions. Experienced in building end-to-end ML pipelines, GenAI applications, and data-driven products that deliver measurable business impact.

4+Years in AI/ML
15+Projects Delivered
Ideas & Energy
Arsalan Anwar - Data Scientist
🤖
📊
🧠

Education

New York University

Master of Science in Computer Science

New York University

New York, NY

Sept 2023 - May 2025
Bangalore Institute of Technology

Bachelor of Technology in Computer Science

Bangalore Institute of Technology

Bangalore, India

Aug 2016 - Aug 2020

My Tech Stack

Projects

Certification

Leadership & Extracurricular

Beyond technical skills - leadership, community impact, and personal growth

Judging Panel
Judge: NYU 51st Undergraduate Research Conference

Judge: NYU 51st Undergraduate Research Conference

New York University

May 2025

Judge: NYU 51st Undergraduate Research Conference

New York University

Served as a judge for Panel 18 at NYU's 51st URC, evaluating interdisciplinary research in neuroscience, data science, and climate innovation.

Key Skills:

Academic LeadershipResearch EvaluationMentorshipInterdisciplinary Thinking
May 2025
Judging Panel
Judge: NYU 50th Undergraduate Research Conference

Judge: NYU 50th Undergraduate Research Conference

New York University

May 2024

Judge: NYU 50th Undergraduate Research Conference

New York University

Evaluated student research in Poster Group 11 (Computer Science), providing feedback and engaging in discussions across diverse STEM fields.

Key Skills:

Academic LeadershipResearch CommunicationSTEM EngagementMentoring
May 2024
Teaching and Academia
Graduate Adjunct Faculty, Dept. of CS

Graduate Adjunct Faculty, Dept. of CS

New York University, Courant Institute

Jan 2024 - May 2025

Graduate Adjunct Faculty, Dept. of CS

New York University, Courant Institute

Designed and delivered graduate-level lectures and labs in data science, fostering academic rigor and practical learning.

Key Skills:

Graduate TeachingCurriculum DesignAcademic MentorshipInstruction
Jan 2024 - May 2025
Presentations
Speaker: AI/ML and its Applications

Speaker: AI/ML and its Applications

Bangalore Institute of Technology

2021

Speaker: AI/ML and its Applications

Bangalore Institute of Technology

Spoke to 100+ undergraduates about real-world AI/ML use cases, inspiring exploration in applied machine learning.

Key Skills:

Guest LecturingMentorshipML ApplicationsEducation
2021
Cultural Leadership
Core Committee Member

Core Committee Member

Kannada Samskruti Club

2018 - 2020

Core Committee Member

Kannada Samskruti Club

Led cultural programs, coordinated events, and promoted language-based inclusion for 500+ students.

Key Skills:

LeadershipEvent CoordinationCommunity BuildingTeamwork
2018 - 2020
Teaching and Academia
Adjunct Faculty: Data Science and SQL

Adjunct Faculty: Data Science and SQL

Intellipaat

Jan 2023 - Aug 2023

Adjunct Faculty: Data Science and SQL

Intellipaat

Taught hands-on data science and SQL concepts to 300+ learners across diverse backgrounds through live sessions.

Key Skills:

InstructionSQLData ScienceCurriculum Delivery
Jan 2023 - Aug 2023
Presentations
Speaker: ChatGPT & In-Demand AI Skills

Speaker: ChatGPT & In-Demand AI Skills

UpGrad

May 2023

Speaker: ChatGPT & In-Demand AI Skills

UpGrad

Explained ChatGPT’s rise and its potential for transforming careers, research, and productivity in AI domains.

Key Skills:

Public SpeakingGenerative AICareer GuidanceEducation
May 2023
Presentations
Speaker: ChatGPT for Research and Innovation

Speaker: ChatGPT for Research and Innovation

UpGrad

May 2023

Speaker: ChatGPT for Research and Innovation

UpGrad

Highlighted the role of ChatGPT in accelerating innovation and enabling new research paradigms in AI.

Key Skills:

Public SpeakingAI ToolsResearch ApplicationsMentoring
May 2023
Community Leadership
Founder & Researcher

Founder & Researcher

Data Science Crew

Sept 2020 - Present

Founder & Researcher

Data Science Crew

Founded and scaled a 1,000+ member global community for learning and sharing knowledge in AI/ML domains.

Key Skills:

Community BuildingLeadershipTechnical MentoringContent Strategy
Sept 2020 - Present
Community Leadership
Author (The Startup & Analytics Vidhya)

Author (The Startup & Analytics Vidhya)

Medium

2020 - Present

Author (The Startup & Analytics Vidhya)

Medium

Authored articles on Reinforcement Learning and ROS, gaining thousands of reads and helping early-career ML enthusiasts.

Key Skills:

Technical WritingTutorial DevelopmentML EducationROS
2020 - Present
Presentations
Speaker: A General Introduction to Data Science and Machine Learning

Speaker: A General Introduction to Data Science and Machine Learning

West Pharmaceutical Services

2022

Speaker: A General Introduction to Data Science and Machine Learning

West Pharmaceutical Services

Delivered an internal talk to West teams, simplifying key ML concepts and bridging knowledge gaps across departments.

Key Skills:

Data Science EducationPublic SpeakingCross-functional CommunicationLeadership
2022
Presentations
Speaker: Image Classification with Azure ML

Speaker: Image Classification with Azure ML

Microsoft User Group Hyderabad

Nov 2020

Speaker: Image Classification with Azure ML

Microsoft User Group Hyderabad

Conducted a technical session on deploying image classification models using Azure ML, engaging with a broad developer audience.

Key Skills:

Public SpeakingComputer VisionAzure MLCommunity Engagement
Nov 2020

Professional Experience

Building innovative solutions across industry-leading organizations

Medidata Solutions (a Dassault Systèmes company)

Data Science Intern

Medidata Solutions (a Dassault Systèmes company)

Feb 2025 - Jun 2025
  • 🔹Developed an automated validation framework to evaluate model performance for clinical form extraction and edit check generation, significantly reducing manual QA efforts
  • 🔹Partnered with cross-functional teams to map historical edit check metadata to study protocols using Snowflake, improving data traceability
  • 🔹Co-designed a POC for the next-gen ML Prediction Service (Coder+) for intelligent medical coding using RAG architecture with vector embedded MedDRA and WHODrug data stored in Mongo Vector DB
New York University

Data Scientist (Research Assistant)

New York University

Jan, 2024 - Present
  • 🔹Built NYU Mate, a GenAI academic assistant using LLaMA 2 and LangChain-based RAG pipeline, improving student support satisfaction by 18%.
  • 🔹Developed retrieval-augmented search over NYU’s academic and policy documents, achieving over 92% accuracy in answer relevancy.
  • 🔹Collaborated with university departments and implemented scalable backend with FastAPI and vector search (FAISS), reducing student query response time by 65%.
Medidata Solutions (a Dassault Systèmes company)

Data Science Intern

Medidata Solutions (a Dassault Systèmes company)

May 2024 - Aug 2024
  • 🔹Automated clinical trial consent workflows using NLP and Large Language Models (LLMs), cutting setup time by 45% and costs by $6 million annually
  • 🔹Boosted document processing efficiency by 20% with a custom algorithm for multilingual text extraction, eliminating manual language annotation
  • 🔹Achieved 98% accuracy in form-field mapping using AWS Bedrock and Claude AI, reducing annotation time by 87% (15min to 2min)
C5i

Data Scientist

C5i

Feb 2023 - Aug 2023
  • 🔹Built Apriori-based Market Basket Model to drive upsell strategies for a Fortune 500 client, boosting revenue opportunity identification by 26%.
  • 🔹Developed high-value customer segmentation using clustering and purchase behavior, achieving 94% targeting accuracy.
  • 🔹Led insights automation pipeline development for weekly business reporting, reducing manual workload by 80%.
West Pharmaceutical Services

Data Scientist

West Pharmaceutical Services

Aug 2022 - Feb 2023
  • 🔹Deployed real-time computer vision models (XceptionNet) to detect cosmetic defects in syringe products, increasing inspection accuracy to 92%.
  • 🔹Automated root-cause defect analysis using time-series clustering, reducing quality investigation time by 40%.
  • 🔹Collaborated with manufacturing teams across 3 continents, aligning ML solutions with FDA and ISO 13485 compliance.
West Pharmaceutical Services

Associate Data Scientist

West Pharmaceutical Services

Jul 2020 - Jul 2022
  • 🔹Developed NLP-based document parsing system for internal West documents, improving keyword mapping and retrieval accuracy by 23%.
  • 🔹Built robust ELT pipelines on Azure Data Factory for 6+ data sources, reducing data preparation time by 40%
  • 🔹Used Azure ML Studio for model deployment in GxP environments, ensuring reproducibility and traceability in production pipelines.
West Pharmaceutical Services

Graduate Software Trainee

West Pharmaceutical Services

Jan 2020 - Jun 2020
  • 🔹Engineered image classification pipeline for particulate detection in injectables using Azure ML, achieving 97% accuracy.
  • 🔹Implemented batch scoring services in Python with REST APIs, improving system scalability across 4 production lines.
  • 🔹Documented model interpretability workflows using SHAP to meet regulatory transparency standards.