Available for Data Analyst Roles

Sai Thanish
Voore

Data Analyst turning complex datasets into decisions — through SQL, Python, Power BI, and ETL pipelines across healthcare, government, and enterprise domains.

5+
Years Experience
35%+
Query Performance Gain
20%
Cost Reduction @ Cigna
1M+
Rows Analyzed
// Core proficiency
SQL / T-SQL96%
Power BI / DAX90%
Python / Pandas88%
ETL Pipelines85%
Statistical Analysis82%
01 — About

The analyst behind
the numbers

I'm a Data Analyst with a Master's in Computational Data Science from Purdue University, currently working at the State of Louisiana – TRS in Baton Rouge where I manage and analyze pension datasets at scale.

My background spans healthcare analytics at Cigna, enterprise data engineering at Cognizant, and early-career healthcare data work — giving me a rare breadth across clinical, financial, and operational domains.

I believe the best analysis isn't the most complex — it's the one that actually changes a decision. I focus on delivering insights that stakeholders can act on, not dashboards that collect dust.

I also published research on machine learning classification in the International Research Journal of Engineering and Technology (IRJET, 2022).

EDU
Education
MS Computational Data Science — Purdue
DOM
Domains
Healthcare · Government · Logistics · BI
NOW
Currently
DBA Data Analyst @ State of Louisiana TRS
PUB
Published Researcher
IRJET, Vol. 9(3), 2022 — ML Classification
02 — Experience

Where I've built
real impact

DBA Data Analyst
State of Louisiana – TRS · Baton Rouge, LA
Apr 2025 – Present
  • Analyze 5M+ row pension datasets using advanced SQL and EDA, reducing report generation time by 30%
  • Optimized 40+ complex SQL queries and redesigned indexing, improving query performance by 35%+
  • Built and deployed 6 Power BI dashboards with DAX measures for real-time KPI visibility
  • Led Oracle → SQL Server migration, validating 20+ transformation scripts for full data consistency
  • Delivered structured analytical reports on bi-weekly sprint cycles across 3 cross-functional teams
Data Science Analyst
The Cigna Group · Indianapolis, IN
Mar 2024 – Dec 2024
  • Led AI-driven EHR optimization using Python, scikit-learn, and Pandas — cut operational costs by 20%
  • Managed and queried 10M+ patient records in MySQL via DBeaver; improved retrieval times by 40%
  • Automated data cleaning pipelines in Python, reducing processing time by 35%
  • Delivered weekly analytical reports translating model outputs into clinical care pathway decisions
Data Solutions Engineer
Cognizant Technology Solutions · Hyderabad, India
Jan 2022 – Dec 2022
  • Built ASP.NET + React.js workflows for a food platform processing 50K+ daily transactions
  • Architected SQL Server schemas for 3M+ records, reducing query latency by 25%
  • Containerized data pipelines with Docker, cutting release cycles by 40% across 4 environments
  • Deployed real-time order-tracking dashboard that reduced support tickets by 15%
Data Analyst
Exposys Data Labs · Remote, India
Jun 2020 – Dec 2021
  • Statistical analysis and predictive modeling on healthcare datasets covering 100K+ patient records
  • Executed 30+ SQL scripts resolving data discrepancies, improving data quality by 20%
  • Built Tableau templates adopted across 5 business units for standardized reporting
03 — Skills

The full technical
toolkit

Languages & Tools
Python Pandas NumPy scikit-learn SQL / T-SQL PL/SQL R C# VBA / Macros Docker Git
Analytics & Reporting
Statistical Analysis Hypothesis Testing Data Mining EDA KPI Tracking Predictive Modeling Forecasting A/B Testing
Visualization
Power BI DAX Power Query Tableau Executive Dashboards Operational BI
Data Platforms
SQL Server Oracle MySQL Snowflake SSIS SSRS DBeaver
Domains
Healthcare Analytics Claims & Clinical Government / Pension Logistics & Ops ETL Pipelines Data Governance
Process & Methodology
Agile / Scrum SDLC CI/CD Data Modeling Root Cause Analysis Stakeholder Reporting
04 — Projects

Things I've built
and measured

01
Machine Learning · Python
Customer Behavior Prediction Model

End-to-end ML pipeline for customer segmentation using Python and scikit-learn. Includes feature engineering, PCA dimensionality reduction, cross-validation, and pipeline optimization on a 500K+ row dataset.

Python scikit-learn Pandas PCA XGBoost
82%
Classification Accuracy
500K+
Rows Processed
30%
Faster Training
02
Business Intelligence · Power BI
Sales & Customer Insights Dashboard

Interactive Power BI dashboard with 15+ complex DAX measures tracking customer retention, revenue trends, and real-time KPIs across 3 product lines. Identified actionable churn patterns leading to measurable retention improvements.

Power BI DAX Power Query KPI Design
15+
DAX Measures
3
Product Lines
12%
Retention Uplift
03
Research · Machine Learning
Student Query Classification — Published Research

Applied machine learning classification techniques to categorize student queries, contributing to intelligent academic support systems. Published in the International Research Journal of Engineering and Technology (IRJET), Vol. 9(3), 2022.

NLP Classification Python Published
IRJET
Published Journal
2022
Vol. 9, Issue 3
04
Healthcare Analytics · ETL
EHR Optimization System @ Cigna

AI-driven Electronic Health Records optimization system built during tenure at The Cigna Group. Combined predictive modeling with automated ETL pipelines to improve clinical decision-making and reduce operational overhead.

Python scikit-learn MySQL ETL Healthcare
20%
Cost Reduction
40%
Faster Retrieval
10M+
Records Managed
05 — Education & Certifications

Academic foundation

Master of Science
Computational Data Science — Purdue University
Dec 2024 GPA 3.4 / 4.0
Bachelor of Engineering
Computer Science — SCSVMV University
Jun 2022 GPA 3.73 / 4.0
IBM
SQL for Data Science
Amazon Web Services
AWS Cloud Practitioner
DeepLearning.AI
Neural Networks & Deep Learning
06 — Contact

Let's work together

I'm actively looking for Data Analyst roles. If you're hiring or know someone who is — let's talk.

saithanishvoore@gmail.com