The Most Visited Job Board For Women For Over 20 Years
LOG IN OR REGISTER TO CONTACT ME
This button will open the login/register page in a new tab. After logging in, come back to this page and refresh your browser.
) Jan - Jun
- Optimized ETL (Extract Transform Load) process from SQL Server to Big Query, reducing data migration time by %.
- Integrated legacy data for states with + features using Python, to facilitate the transition to a new data infrastructure.
- Developed a real-time analytics platform using GCP Looker, delivering insights for risk assessment and policy pricing.
- Leveraged regression analysis using Python to identify high-risk counties, which led to a % reduction in claim forecasts.
- Enhanced forecasting accuracy by % in the Tableau dashboard by incorporating seasonal trends in time-series analysis.
Tata Motors Limited - Senior Manager (Pune, India) Aug - Jun
- Developed data pipelines with Airflow, streamlining ETL processes, resulting in saving $, annually in operational costs.
- Implemented Spark-based analytics for real-time vehicle performance data, supporting the development of new vehicle models.
- Built ETL pipelines for . TB daily vehicle telematics data, enhancing data availability for predictive maintenance analysis.
- Transformed data reporting solutions by connecting AWS Redshift and Tableau with SQL, elevating stakeholder experience.
- Streamlined data collection process using Python from + dealers, boosting customer retention rate by .% by timely analysis.
Hyster Yale Group - Data Analysis Intern (Pune, India) May - Jul
- Created a Python-based tool for automating data cleaning to reduce weekly manual data preparation time by hours.
- Performed supply chain cost analysis using Python, identifying vendor selection process inefficiencies and saving cost by %.
- Consolidated SQL, Excel, and cloud data into Power BI to enhance stakeholder accessibility and data consistency by %.
TECHNICAL SKILLS
Programming Languages & Frameworks: Python (NumPy, Pandas, Matplotlib), Apache Spark, R (tidyverse, ggplot)
Databases: MySQL, Big Query, NoSQL
Data Visualization Tools: Tableau, Power BI, Looker (Google Data Studio), Advanced Excel
Machine Learning Algorithms: Linear Regression, Decision Trees, Clustering
Cloud Services: Google Cloud Services (GCP), Amazon Web Services (AWS), MS Azure
Other Tools & Skills: Apache Airflow, Alteryx, Confluence, Jira, Agile
Certifications: Google Analytics
EDUCATION
Northeastern University, Boston, MA Dec
Master of Science in Engineering Management and Graduate Certificate in Data Analytics GPA - ./.
Courses: Data Mining, Statistical Methods, Probability, Data Management for Analytics, Visualization for Analytics
Extracurriculars: Course Assistant - Operation Research, Secretary - NU American Society for Engineering Management
COEP Technological University, Pune, India Jun
Bachelor of Technology in Mechanical Engineering GPA - .
Courses: Statistics, Numerical Methods and Computer Programming, Project Management
Extracurriculars: President - Student Association, State-Level Swimmer, Rowing and Kayaking at th Regatta, Dance and Art
PROJECTS
Credit Card Fraud Detection (Python) Sep - Dec
- Performed exploratory data analysis and feature engineering, identifying correlated columns with values greater than ..
- Applied different classification models including Random Forest, KNN, etc to reach the best F score of ..
- Developed dashboards to visualize real-time fraud detection results, allowing stakeholders to respond to credit card fraud swiftly.
Predictive Analytics: Advanced Regression Techniques (Python, R) Sep - Dec
- Applied forward selection and correlation analysis to refine model features, enhancing predictability in complex datasets.
- Built Linear Regression and SVR models using scikit learn, focusing on cross-validation and performance metrics.
- Achieved an R-squared value of . with SVR, demonstrating effective model tuning and generalization to new data.
Bike Renting Company Data Analysis (R, Excel, Canva) Jan - Apr
- Suggested advancements in the existing framework by analyzing data of k+ customers, + routes, and + stations.
- Supported claims by plotting + advanced visualizations and conducting network analysis using ggplot, dplyr, and igraph.
- Scrutinized data to find seasonal trends to recommend strategies to efficiently plan future demands and scheduled maintenance.