Skip to content
View pngo1997's full-sized avatar

Block or report pngo1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pngo1997/README.md

Hi there, I'm Mai Ngo! πŸ‘‹

A passionate Data Scientist with a strong foundation in Machine Learning, NLP, data visualization, and Big Data Processing. I truly believe that data can drive impactful business decisions and foster societal growth. My goal is to extract meaningful insights, develop scalable models, and create data-driven solutions that make a difference to both technical and non-technical audiences.

πŸŽ“ Education

  • Master’s in Data Science (Computational Methods) | DePaul University.
  • Bachelor’s in International Business, Finance, and Economics | University of Wisconsin-Superior.

πŸš€ About Me

  • πŸ“Š Expertise in: Data Science & Statistical Analysis, Machine Learning, Recommender Systems, NLP, Deep Learning (RNN, LSTMs, Transformers), Data Mining & Visualization, Cloud Computing (AWS, Hadoop), and Power BI/Tableau.
  • πŸ“Œ Industry Experience: Data Science & Statistical Analysis, Data Mining & Visualization, Data Warehouse, Business Intelligence, Recommender Systems, Natural Language Processing, Machine Learning Models, Deep Learning, Programming, Compliance, Project Management, Customer Service, Supervision.
  • πŸ”₯ Passionate About: AI, Scalable ML Systems, LLMs, and Data-Driven Decision Making.

πŸ›  Technical Skills

πŸš€ Machine Learning & AI: TensorFlow PyTorch Scikit-Learn Keras Hugging Face
πŸ“Š Data Analytics & Visualization: Pandas NumPy Power BI Tableau
πŸ’Ύ Databases & Cloud: AWS Hadoop SAP S/4HANA SQL
πŸ›  Programming Languages: R Python SQL SAS
πŸ“‚ Other Tools: Jupyter Notebook Salesforce Microsoft Office

πŸ”Ž Key Projects

1️⃣ 🏑 Semantic-driven Hybrid Recommender System for Chicago Airbnb Listings

  • Built a system leveraging embeddings, sentiment analysis, and proximity to train stations to enhance Airbnb recommendations.

2️⃣ 🍽️ Item-based Collaborative Recommender for Yelp Establishments

  • Developed collaborative filtering models to recommend establishments based on shared characteristics.

3️⃣ πŸ“ˆ Financial Data Analysis - AXA Underwriting Insights

  • Power BI Framework for Underwriting Analytics – Built a custom reporting and analysis framework in Power BI to analyze AXA underwriting performance.

4️⃣ πŸ“° Fake News Detection with NLP

  • Designed an NLP-based misinformation classification model using TF-IDF and LSTMs.

5️⃣ πŸ€– Building N-gram Language Models & Retrieval Augmented Generation (RAG)

  • Trained Mistral 7B & GPT-3.5 Turbo to evaluate perplexity and retrieval efficiency.

🌱 Currently Exploring

  • Business analytics framework and data warehouse.
  • Expanding my expertise in LLMs (Mistral, T5), Vector Search, and RAG.
  • Exploring MLOps, Databricks, and scalable ML deployment.
  • Actively seeking new opportunities in Data Science & AI.

πŸ“« Connect with Me

Looking forward to connect with you! ⚑

Popular repositories Loading

  1. Yelp-Business-Recommender-System Yelp-Business-Recommender-System Public

    Building an item-based collaborative recommendation system using embeddings for establishments from the Yelp dataset.

    Jupyter Notebook 1

  2. Sephora-Return-Policy-Evaluation-Data-Warehouse Sephora-Return-Policy-Evaluation-Data-Warehouse Public

    Evaluates Sephora's return policy using data warehousing principles.

    1

  3. Astrophysical-Objects-Classification Astrophysical-Objects-Classification Public

    Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).

    Jupyter Notebook 1

  4. N-gram-Language-Models N-gram-Language-Models Public

    Builds N-gram language modes and applies text generation.

    Jupyter Notebook 1

  5. Chicago-Airbnb-Hybrid-Recommender-System Chicago-Airbnb-Hybrid-Recommender-System Public

    Develops a hybrid recommender system for Chicago Airbnb listings using data from Inside Airbnb.

    Jupyter Notebook 1

  6. AXA-XL-Insurance-BI-Dashboard AXA-XL-Insurance-BI-Dashboard Public

    Provides a comprehensive analysis of insurance submissions, approvals, compliance rates, and profitability for AXA XL Insurance.

    1