Santa Barbara, CA • Available June 2025

Data ScienceSpecialist

Statistics and Data Science student at UC Santa Barbara with hands-on experience in healthcare analytics, AI research, and data processing. Passionate about transforming complex datasets into actionable insights through machine learning and statistical modeling.

Natalie Lubich - Data Science Specialist

About Me

With a passion for data-driven decision making, I specialize in extracting meaningful insights from complex datasets to solve real-world business challenges.

Data Analysis

Expert in statistical analysis, data visualization, and exploratory data analysis using Python, R, and SQL to uncover hidden patterns.

Machine Learning

Building predictive models and implementing ML algorithms for classification, regression, and clustering problems across various industries.

Business Intelligence

Creating interactive dashboards and reports that translate complex data into actionable business insights for stakeholders.

Resume

Statistics and Data Science student with proven experience in healthcare analytics and AI research

Download Resume PDF
Education

University of California, Santa Barbara

Bachelor of Statistics and Data Science

GPA: 3.7

Expected Graduation: June 2025

Relevant Coursework:

Machine Learning
Regression Analysis
Time Series
Mathematical Statistics
Data Science with R
Intermediate Python
Linear Algebra
SAS Base Programming
Work Experience

Data Analyst Intern

LeanTaaS

June 2024 - Aug 2024

  • Contributed to a growth-stage company transforming hospital and infusion center operations using lean principles, predictive and prescriptive analytics, and machine learning
  • Refactored over 4,500 lines of legacy Python ELT preprocessing code using Git version control, enhancing data management and readability for sensitive hospital patient data, significantly benefiting 3 key customers
  • Collaborated with Marketing team to analyze 2023-2024 campaign performance, creating Python visualizations that uncovered insights and optimized strategies to meet organizational KPIs
  • Managed complex data integration processes and communicated technical concepts to non-technical stakeholders in healthcare sector

AI Research Intern

Keywords Studios

Jun 2023 - Mar 2024

  • Conducted research in AI-powered natural language processing solutions
  • Orchestrated production of extensive dataset comprising over 3,000 recordings essential for training and fine-tuning AI models
  • Conducted rigorous Quality Assurance processes, ensuring precision and reliability of AI systems
  • Thrived in fast-paced, fully-remote environment while collaborating with multidisciplinary teams

Data Processing Intern

SF Pathways to Citizenship

Jan 2020 - May 2020

  • Boosted runtime of automation tests by 200% for core survey platform
  • Manipulated user data in SQL database and created interactive Tableau dashboard
  • Taught QA principles to fellow interns, enabling increased automation development
Technical Skills

Languages

Python
R
SQL
Java
C
Swift
SAS
JavaScript
HTML/CSS
Bash

Libraries & Frameworks

Pandas
NumPy
BeautifulSoup
Selenium
Flask
Spring Boot
PyTest
JUnit
Bootstrap 5

Tools & Platforms

Git
GitHub Actions
AWS
Tableau
MongoDB
SQLite
Splunk
Figma
Jupyter
Leadership & Volunteering

STEM Subject Tutor

The Village Project SF

Jan 2019 - Mar 2020

  • Empowered students from SF marginalized communities in STEM subjects
  • Mentored students in Math and Computer Science, fostering technical skills and study strategies

Diversity in Tech Advocate

  • Teaching Data Science principles and Algorithms to underrepresented groups
  • Helping empower women and people of color in STEM fields
  • Boosting leadership and public speaking skills in students

Technical Skills

Proficient in modern data science tools and technologies

Programming

Python
R
SQL
JavaScript
Scala

Libraries & Frameworks

Pandas
NumPy
Scikit-learn
TensorFlow
PyTorch

Visualization

Tableau
Power BI
Matplotlib
Seaborn
Plotly

Tools & Platforms

AWS
Google Cloud
Docker
Git
Jupyter

Featured Projects

Academic and personal projects showcasing data science and software development skills

GradeBook Manager
GradeBook | Grade Manager with Auto Grouping
Comprehensive grade management system with automatic statistics calculation and data categorization
  • • Automatically calculates statistics and categorizes course data by coursename, subject
  • • Command line interface with full CRUD operations
  • • Persistent storage using serialization
  • • Comprehensive PyTest test suite for reliability
Python
PyTest
Data Structures
CLI
Maze Solver
MazeSolver | Large-Scale Maze Generation & Solving
High-performance maze solver capable of handling massive 7400x7400 mazes using optimized algorithms
  • • Solves mazes as large as 7400x7400 with optimized data structures
  • • Recursive algorithms and stack-based maze solving
  • • Pre-order traversal with BFS for node discovery
  • • Multiple difficulty settings and storage capabilities
Python
Algorithms
Data Structures
BFS
PyTest

Let's Work Together

Ready to turn your data into actionable insights? Let's discuss your next project.