share-CV
Machine Learning Engineer

Pham Van Hoang Phi

📷 Information

  • Name: PHAM VAN HOANG PHI
  • Github: https://github.com/hoangphi1311
  • Email: hoangphi1311.gpa@gmail.com

🏃 Experience

Antsomi - CDP 365 (The Data Outsource Company) (May 2023 - Present)

Data Analyst - Data Integration - Data Scientist Specialist

  • Define Requirement and Select the data
    • Communicate with stakeholders to define Business Requirements and Business Rules.
    • Work with the Business Analyst team to understand the data.
    • Work with the Marketing team to plan and launch the Campaign.
    • Define and Select the necessary data for analyzing.
    • Build some Attributes and Metrics that are necessary for analyzing.
  • Data Integration and ETL
    • Use DBeaver and PostgreSQL to connect to the Business Database.
    • Transform data by using SQL and Python language.
    • Integrate data into the CDP platform using Airflow.
  • Data Analysis and Data Visualization
    • Analyze the data by using Google BigQuerry and Python.
    • Make the report by using Google Looker Studio and PowerBI.
    • Check and verify the data to make sure the Report is ready for presentation.
  • Communicate and help Business make decision
    • Make slides and present insights to the leader.
    • Present and communicate with Business.
  • Automate and Monitor
    • Track daily reports and optimize data flows.
    • Automate processes for efficiency.
    • Coordinate with business teams to address data-related issues.
  • Key Projects Undertaken
    • Daily Operational Dashboard.
    • Customer Segmentation using RFM model and K-Mean for clustering.
    • Cohort Retention Analysis.
    • A/B Testing for marketing campaigns.

🖥 Skills

  • Languages: Python, SQL, Google BigQuerry, Airflow
  • Frameworks: Hadoop, PySpark
  • Libraries: Matplotlib, Seaborn, Google Data Studio, Tableau, Power BI
  • Databases: PostgreSQL
  • Tools: DBeaver, SSIS, Latex, Adobe Photoshop, Adobe Premier

🗨️ English

📑 Project

Daily Operational Dashboard

  • A dashboard that provides an overview of the store's overall YTD business situation and sales metrics.
  • Includes customer segmentation analysis based on RFM metrics.
  • Customer cohort analysis.
  • Forecasting and returned goods prediction. Tools: SSIS (ETL), SQL, Power BI & Tableau (Visualization), Python (Clean & Prediction).

Customer Segmentation

  • About Customer Segmentation by using RFM metrics and K-Mean Clustering algorithm.
  • Includes data cleaning and exploratory data analysis.
  • Calculate RFM score for each customer.
  • Data preprocessing: transformation and standardization data.
  • Customer clustering by K-Mean algorithm and analysis. Tools: Python