"Don't let what you think you canโt do interfere with what you can do."
Apr 2024 - now
- Analyst, Business Insights
, Accounting, Tax & Finance - Hudson's Bay Company - Toronto, Ontario, Canada ๐๐จ๐ฆ
- Responsible for initiatives, applications and realizations of technologies and languages for multiple US & Canada teams: Indirect Tax, Corporate Tax, Accounting, Operations Logistics, and management of involved member & development, testing and production phases
- Develop, upgrade, scale up Alteryx workflows to enable automation on complex reporting processes from manual 2-5 working days to 15-30 minutes of running/processing time (in average)
- Preprocess data in Alteryx, integrate with visualization tool, build up Tableau dashboards to visualize audit of US & Canada business entities' Reconciliation summaries based on locations and states/provinces across multiple dimensions, tax categories
- Enhance, modify workflows which checking and refactoring codes, configurations, functions, abnormalities to audit and optimize Alteryx performance, Oracle & Snowflake SQL analytical queries and Python codes on monthly basis for different projects: Account Payable Tax Recovery, Provincial Tax Rate Adjustment, Courier's Freight Fee and Tax Reconciliation, Business Unit Tax Computation, Cost Below Sales Analysis, Cash Forecast, Tax Returns, Corporate Tax Provision, Card-Related Loyalty Recoveries, Vendors' Tax Code Compliance, State-wise Tax Validation, etc
- Integrate, combine Alteryx nodes with Python's (Extract-Transform-Load) ETL, Machine Learning, Natural Language Processing (NLP) using multiple libraries, models, techniques, decision tree diagrams on multi-classifying US & Canada SKU Tax Codes, Use Tax Rates at SKU, POS, product, category, invoice levels
- Accomplish, perform automatable Alteryx processes to replicate human's traditional accounting tasks, automatically generate sophisticated reports and deliver insights & stories while exchanging discussions with stakeholders across departments, teams and borders
- Cooperate with IT Teams to access and enable Robotic Processing Automation (RPA) flows using UiPath powered by AI Assistant and business intelligence dashboards for Tax-related activities
- Skills: Alteryx, Dataiku, Power BI, Tableau, SQL, Python, Machine Learning
Jan 2023 - Apr 2024
- Alteryx Administrator
, AWS Cloud Ops Data Migration - Billennium IT Inc for Roche (Swiss BioTech), Data Engineering - Integration, Data Services & Insights Foundational Domain - Toronto, Ontario, Canada ๐๐จ๐ฆ
- Translate business needs to technical requirements & synthesize insights, solutions through ServiceNow to technical & non-technical global Roche stakeholders while researching new tools & technologies in business intelligence & ETL areas
- Perform maintenance, upgrade, backup, installation of Designer workflows from Roche on Alteryx Server, MongoDB, AWS
- Develop batch scripting, Rest API in Python, R with complex, efficient SQL queries & extract insights from Tableau reports
- Alteryx User ID 288253: working towards Alteryx Designer Expert Exam and participating in multiple weekly challenges
Jan 2021 - Aug 2022
- Business Insights & Analytics Post-Graduate Program
- Humber College - Toronto, Ontario, Canada ๐๐จ๐ฆ
Jan 2021 - now
- Data Science Intern
(remote) - Cohost AI (founded in San Francisco, USA, based in Ha Noi, Viet Nam) - Toronto, Ontario, Canada ๐๐จ๐ฆ
- Completed 2 projects using
Python, SQL, Power BI, Alteryx, Excel
to research hidden patterns, trends, insights for travel agencies & refactor the code base by developing functions, discussing with stakeholders for each topic to optimize reuse & navigation by 90% - Validated 95% of new metrics: Inventory, Room Night & Sold, Average Daily, Occupancy, RevPAR specialized on each topic for pricing recommendations, & growing seasonal sales, analysis of sources, properties, booking behavior, stay period & length
Jan'22 - Apr 2022
- Data Analyst Intern
- iRestify Inc. (based in Toronto, Canada) - Toronto, Ontario, Canada ๐๐จ๐ฆ
- Supported cross-departmental projects by
Power BI
to analyze insights, & performance from customer surveys & end-user discussion with customized charts, tables, reports usingDAX, MDX
queries with engineered KPIs, ratios, & conditional features - Identified & reported trends, & patterns, measured stakeholdersโ compliance, early, late, and on-time completeness for the operations & customer success to increase productivity by 70% for users to reuse, update, & interact on
Power BI Service App
Aug-Dec 2021
- Data Engineering & Analytics Intern
(remote) - Center of Talent in AI (CoTAI, based in Ho Chi Minh City, Viet Nam) - Toronto, Ontario, Canada ๐๐จ๐ฆ
- Brainstormed with AI Scientist & developed Data Engineering pipeline & database structure in
Python, SQLAlchemy, SQL
to retrieve, preprocess big data in million rows, & generate it from API faster by 10h per load, & function loops in Python for Sentiment Analysis - Built 90% new Tableau charts & metrics to discover driven factors & intentions, minimize complaints, & negative feedback
- Compiled Machine & Deep Learning classifiers tackling imbalanced datasets to detect fraud for Bankingโs Marketing Targets
Topic | more projects available on GitHub & Tableau Public |
---|---|
IEEE-CIS Fraud Detection (Capstone, Humber College) | - Preprocessed data in Python , designed architecture solution, analyzed performance between ML classifiers to determine the best performers on the imbalanced dataset, Balanced Random Forest with ROC AUC around 0.9 & Random Forest with ROC AUC, Precision around 0.9 |
Safe Roads 2022 Competition - Toronto Police Service | - Used Power BI, Python, Azure Machine Learning to analyze geospatial datasets, provide interpretation, conduct A/B testing , determine factors, recommend on road conditions, awareness, top fatal intersections to enhance traffic safety, prevent fatal accidents, achieve prediction using Random Forest โs ROC AUC & Precision around 0.8 |
Sentiment Analysis | - Conducted Sentiment Analysis on customerโs comments & analyzed data generated from a system using Natural Language Processing through API on Fan Pagesโ dialogs of diet products & participated in Data Operations, ETL in Python , SQL in MySQL , Azure , Visualization in Tableau to determine top customers, top efficient fan pages, most crucial intentions & demand entities, peak effective contact hours, peak periods of confirmations, common complaints |
Banking Dataset โ Marketing Targets | - Used classification methods of ML, DL in Python to predict more accurately filing a claim while avoiding overfitting on an imbalanced dataset; - RUS Boost had the highest Balanced Accuracy, Geometric Mean, F1 scores & best Confusion Matrix among classifiers |
SQL Murder Mystery | - Determined the extract murder and killing planner with the shortest-possible SQL queries from basic to intermediate querying skills & approaches using: INNER/LEFT JOIN, GROUP BY, WITH, WHERE, Sub-Queries |
Porto Seguroโs Safe Driver Prediction | - Used classification methods of ML, DL in Python to predict more accurately auto insurance policy holders filing a claim (predict the probability) while avoiding overfitting on imbalanced dataset - RUS Boost had the highest Balanced Accuracy, Geometric Mean, F1 scores & best Confusion Matrix among classifiers |
Acquisition & Merger Analysis | - Compared techniques between loading dataset in Pythonโs SQL Alchemy to MySQL & loading it in SQL to Hadoop , investigated & identified organizations for the most profitable merger and acquisition by examining accumulated data sets in terms of Sales, Revenue, Product Line in SQL on Zeppelin , visualized charts in Tableau , Power BI |
Pharma Portfolio Predictive Analysis | - Coded in Python and AzureML to analyze time-series pharmaceutical sales data and forecast the key pharma product and predict the patterns in the future |
Annual Sales Analysis & Visualization | - Applied EDA in Python , visualized 200K datapoints to answer Revenue questions - Visualized & compared results between charts in Tableau & Power BI to determine that the variables which caused the highest Sales Value: December, San Francisco, peak hours placing orders, top sold products, correlation between Prices & Volumes |
Income Analysis & Classification | - Preprocessed, analyzed the Income background of all records in Python , SQL & visualized key variables in Tableau / Power BI to determine highlights, trends & predictions of Income types with ML, DL Classifiers |
Eden Hotels & Resorts Group | - Created a Sales Incentive Plan in Java : input, check password, calculate Salespersons, Revenues & export reports, calculated Hotel Revenueโs metrics in Excel to analyze, visualize different types of KPIs - Designed Database and inserted sample data into tables of hotels, guests, employees & bookings in SQL queries |
University Admission | - Led a team & built a Java program (< 150 coding lines) to store information of the newly admitted students, prompted user to enter the student name & high school grades, calculated GPA & assigned to the Universityโs schools |
Investment Analysis of Shopify and Lightspeed in Canada | - Managerial Finance & Accounting Report |
Governance & Ethics in Data | - Gained the highest grade of 95% in all Professor's classes analyzing ethics & governance models about data manipulated in Cybersecurity, COVID-19, Vaccination, etc. - Analyzed 3 aspects of the ethics model, data governance to mitigate potential challenges in the chosen context |
TD Bank's Porterโs Value Chain Analysis (available for being shown only in a section) | - Conducted an analysis of TD Bank over history, vision, mission, strategic and financial objectives, External environment based on PESTEL and Five Forces analysis, Internal environment based on SWOT-analysis, resource and capability analysis, and a value chain analysis, the current strategic approach and its various strategic actions, the staffing practices and strategy execution, Organizational structure. |
Better Working Word - EY, NASA, Microsoft | - Using Python , Machine Learning , Azure Studio , Azure Machine Learning in 3 challenges for 3 months to help locate and protect the biodiversity of frogs by discovering and counting local and global frogs on weather data sampled over space and time (spatiotemporal sampling) with given preliminary F1 score. |
US Medicaid Pharmacy Pricing Analysis | - Establishing tables by nodes and Graph on Neo4j in Cypher, and on Azure in SQL to predict future prices/quantities and important pharmaceutical products of US Medicaid datasets in Python, AzureML |
Home Credit Default Risk | - Connected, transformed datasets, conducted EDA in SQL , Scala on Hive , Zeppelin on customized datasets on the to analyze the loan applicants' background and help expanding to those unable to access financial services - Determined on Zeppelin/ Tableau / Power BI the most significant background check of applicants who got most loan approvals |
Courses | Details |
---|---|
Data Analytics Tools โ | SAS, SPSS Modeler, SPSS, Excel, Cognos |
Managerial Finance & Accounting โ | Excel (Investment Analysis of Shopify and Lightspeed in Canada) |
Big Data โ | Hadoop, R, Neo4j, Cypher, Graph |
Quantitative Research Methods I & II โ | Descriptive & Inferential Statistics, Probability, Normal Distribution, Estimation, Hypothesis Testing |
Database & SQL โ | SQL, ERD, Normalization |
Governance & Ethics in Data โ | Reflection & Integration of Knowledge: Governance & Ethics of Analytics in in Data, AI & Technology - only available from hyperlink in my Resume - (graded 95/100 & feedbacked by Professor. Kathleen Mcginn ๐ง : "My goodness Phuong,Thank you for sharing this with me. It is indeed a very deep, intelligent and meaningful piece of writing that deserves an excellent grade - 95 (!) - the highest grade I have given so far. Congratulations - you have truly earned it." ) |
Canadian Business & Strategy โ | TD Bank's Porterโs Value Chain Analysis & Nucor Corporation Analysis |
Marketing โ | |
Predictive Analytics โ | linear and multiple regression, decision trees, linear programming, factor analysis, cluster analysis, modelling |
Machine Learning and Programming 1 & 2 โ | Python: Data Mining, Data Science, Data Visualization, Dimension Reduction, CRM, Evaluation Predictive Performance, Multiple Linear Regression, K-NN, Naives Bayes Classifier, Classification, Regression Trees, Logistic Regression, Cluster Analysis |
Communication & Data Visualization โ | Excel, Tableau |
Business Intelligence โ | Power BI |
Machine Learning and Programming 2 โ | Python: Time Series Forecasting, Market Basket Analysis, Natural Language Processing |
Capstone Course โ | IEEE-CIS Fraud Detection (Capstone, Humber College) |
Project Management โ | Boeing Aviation Case Report of Sales and Supply Boost |
Criteria | Details |
---|---|
Programming | Certified SQL, Python (Pandas, Numpy, Matplotlib, Keras, SkLearn), Tensorflow Developer (in progress), T-SQL, PL/pgSQL, Java, Scala, R, HTML |
Viz & ETL | Certified Power BI, Tableau Desktop, Alteryx Advanced Designer, Alteryx Designer Cloud Advanced, Alteryx Machine Learning Fundamentals, Tableau Prep, SPSS (Modeler, Statistics), SAS (Studio, Enterprise Miner), Cognos, Qlik |
Big Data | Certified Azure Data Fundamentals, Azure AI Fundamentals, Alteryx Server Administration, Databricks Accredited Lakehouse Fundamentals, AWS (ML & Data Analytics), Azure (ML, Synapse), MySQL, MongoDB, MS SQL, Oracle, PostgreSQL, Hadoop (Hive, Zeppelin), Neo4j, Splunk |
Collaboration wiki | Atlassian Confluence, Jira, Trello |
Languages | English ๐บ๐ฒ (fluent), Vietnamese (native), French ๐จ๐ฆ๐จ๐ต (basic overall, intermediate reading), German ๐ฉ๐ช (basic overall, intermediate reading) |
Others | Certified Six Sigma White Belt, Excel (Solver, GoalSeek, Macros), GDPR, ServiceNow, Confluence, Jira, Trello, Machine & Deep Learning, AI, Teamwork, Statistics, Probability, Sales, Accounting, Finance, Project Management, Hospitality, Presentation, Communication, Marketing |
Earned ๐ | Details |
---|---|
ProtonX | Tensorflow Developer (Statistics, Probability, Algebra, Machine Learning, Deep Learning, AI) |
Center of Talent in AI | Python, Machine Learning, Deep Learning, AI, Reinforcement Learning |
Nordic Coder | Python, Tableau |
DataCamp | SQL Intermediate |
Microsoft Office Specialist | Word, Excel, Powerpoint |
Udemy | Power BI for Business Intelligence |