cv
Basics
| Name | Shirin Shahabi |
| Phone | 365-883-4747 |
| shirin.shahabinejad@gmail.com |
Work
-
2024.05 - Present Hamilton, Canada
Machine Learning Engineer
- Led the development of verifiable LLM on Qwen 2.5 with no overhead of computation and designed Automated testing using Multi Context Protocol (MCP) sampling.
- Led the Quantization and post-training team and Infrastructure on Open-source LLM Models.
- Developed the first Verifiable Agentic AI with proof of inference, enabling verifiable model predictions to meet legal and insurance liability requirements.
- Engineered a comprehensive automated SR&ED auditing documentation pipeline using Google's Flan T5, seamlessly integrating GitHub, Slack messages, and Jira to streamline employee documentation.
- Designed and implemented a ranking system based on DeepFM and BERT, utilizing Reinforcement Learning with Human Feedback (RLHF) and Direct Preference Optimization (DPO), containerized with Docker.
- Developed Autonomous Index Agent with optimizing liquidity and slippage.
- Improved inference time by 23% by developing a Commit-Reveal strategy on Kubernetes using GCP instances.
- Developed Cosmos SDK modules for decentralized AI inference platforms.
- Developed a Retrieval-Augmented Generation (RAG) system using LLaMA 3 for the R&D team specializing in Zero-Knowledge Cryptography deployed on Graph VertexAI.
- Developed Verifiable Ray-tracing physical simulation based on Nvidia Warp, Google DeepMind Mujoco, and Autodesk XLB frameworks.
- Applied structured pruning and quantization techniques on time series prediction models using NVIDIA's pruning methods for 8-billion parameter LLaMA and ResNet models without loss of accuracy.
-
2022.12 - 2024.09 Hamilton, Canada
Machine Learning Developer
- Developed an Trading Bot, with enhanced risk mitigation strategies to prevent market crashes, leveraging FastDQN reinforcement learning to outperform human traders by 185%.
- Implemented multi-objective bi-strategy West America's rail-highway transportation optimization on high-performance computing cluster using Gurobi multi-threading and shell scripting focusing on class-based Supply Chain Management.
- Implemented a multi-objective bi-strategy optimization model using hierarchical reinforcement learning, improving conventional Multi-Armed Bandit problems by 13% with Meta's PEARL framework.
- Instructed 'Statistical Data Analysis in Healthcare' using Python (EHealth 705), with materials available at: github.com/shirin-shahabi/PythonTutorial_Ehealth705.
-
2022.02 - 2023.01 Remote
Data Scientist
- Developed a scalable, robust, and interpretable ensemble tree-based algorithm for clustering over 4 million customers by leveraging Spark for distributed transaction data processing, resulting in increasing MAU from 4M to 8.5M in under a year.
- Led cross-team collaboration with the Product Team to design an end-to-end Marketing Data Modeling and ETL Pipeline using Django and ORM Design.
- Defined +50 financial KPIs and identified company priorities and targeted measures for strategic growth.
- Utilized Web Engage for Customer Journey Mapping, achieving a 35% increase in conversion rate.
- Conducted market research and customer survey analysis, driving a 10% increase in user engagement.
- Collaborated on Power BI and EDA Course for Managers, promoting dashboard adoption for data-driven decision-making.
- Conducted Instagram and Twitter sentiment analysis, extracting key insights to inform marketing strategies, resulting in a 30% increase in retention rate.
-
2022.01 - 2022.02 Tehran, Iran
Financial Analyst
- Designed a comprehensive business plan for a local +12 million dollar hotel construction project.
- Conducted cash flow projections and performed a feasibility study.
-
2020.10 - 2021.12 Remote, Canada
Data Analyst
- Contributed to a visionary product with over $100 million in funding for a fully automated shopping experience (Low-Cost Amazon Go), emphasizing indoor location tracking, in collaboration with Rocket Internet.
- Optimized inventory reordering for 15 hypermarkets with a fully automated algorithm for +8,000 SKUs.
- Collaborated on loyalty, fraud detection, retention, targeted marketing, and churn prediction programs.
- Mentored two interns in the Data Team, now successful Data Scientists one pursuing PhD in University of Michigan and the other doing Master of Computer Science at Waterloo University.
- Delivered +100 optimized SQL queries, benefiting Marketing, Operations, and Finance teams.
- Established real-time operational KPIs with automated email reports for timely insights, ensuring data accuracy.
- Defined packaging strategies and presented enhanced user experiences, resulting in a direct 10% increase in revenue.
Education
Skills
| Programming Languages | |
| Python | |
| R | |
| SQL (MySQL, Postgres, MongoDB) |
| Cloud Platforms | |
| Google Cloud Platform (Kubernetes, Vertex AI Engine) | |
| AWS (Lambda, EC2, SageMaker) |
| Machine Learning Frameworks | |
| TensorFlow | |
| PyTorch | |
| Scikit-learn | |
| Hugging Face |
| AI and NLP Tools | |
| LangChain | |
| NLTK | |
| SpaCy | |
| ONNX |
| MLOps and DevOps Tools | |
| Docker | |
| Git | |
| CI/CD (Argo workflow, Redis, Spark) |
| Web and API Technologies | |
| REST APIs | |
| Streamlit | |
| Shiny App |
| Data Engineering | |
| ETL Pipeline Development | |
| Data Modeling | |
| Data Streaming | |
| Distributed Computing |
| Expertise | |
| Reinforcement Learning | |
| Large Language Models (LLMs) | |
| Retrieval-Augmented Generation (RAG) | |
| Natural Language Processing (NLP) | |
| Inference Engineering | |
| Compression Techniques | |
| Pre-training and Fine-Tuning | |
| Predictive Modeling | |
| Statistical Analysis | |
| Fraud Detection | |
| Verifiable Inference | |
| Agentic AI |
| Operation Research | |
| Mathematical Optimization | |
| Integer Programming | |
| Stochastic Modeling | |
| Discrete-event Simulation | |
| Meta-heuristic Search Algorithms | |
| Process Mining |