EDUCATION
MS in Information Management
University of Illinois Urbana-Champaign
Expected Graduation: May 2020
Coursework: Distributed System, Data Visualization, Statistical Learning, Econometrics
GPA: 3.83/4
BS in Agricultural Chemistry
National Taiwan University
2012
WORK EXPERIENCE
Data Scientist
Meta
Remote, USA
May 2022 -
- Developed and implemented KPI metrics for data center construction and operation, leveraging probability, inferential statistics, and machine learning models. Evaluated construction progress, vendor performance, and energy usage for decision making and strategy development.
- Optimized data center performance and improved business efficiency on energy consumption, sustainability, and carbon emission goals by reframing requirements into data questions and providing analysis and dashboards in collaboration with cross-function stakeholders.
- Delivered ad-hoc analysis from statistical models and complex SQL window functions from various data sources.
Data Scientist
Cloud Imperium Games
Austin, TX
Sep 2020 - May 2022
- Initiated the first analytics schemes for user engagement and defined metrics to help executives understand user behaviors and sales patterns, increasing user engagement on events by 15% and developing revenue boosted products.
- Established end-to-end analytics pipelines from data sources to reports, using the following tools: ElasticSearch, MySQL, EMR, S3, Hive, Athena, Spark, Shell, Python, Kubernetes, QuickSight, and Tableau.
- Created visuals and dashboards to help stakeholders identify business insights at a glance.
- Designed Tableau dashboard templates to improve readability, consistency, and user experiences.
- Mentored junior analysts in statistical analysis, SQL and Python programming by 1:1 meetings.
Analytics Intern
Chegg
Santa Clara, CA
Jun 2019 - Aug 2019
- Designed and built an end-to-end streaming data pipeline that collected data from multiple sources and transmitted it to a dashboard using DynamoDB, Spark Structured Streaming, Redshift, S3, and Django. Deployed and automated the pipeline on EC2 using Jenkins.
- Developed an interactive dashboard using D3.js to showcase the real-time data highlights and demonstrate the potential of the team's new service.
Data Scientist/Information Designer
Freelancer
Taipei, Taiwan
Aug 2017 - Aug 2018
- Developed MongoDB and Python scripts to identify key metrics and potential paid customers for 8 Interactive, a chat bot startup in Taiwan, which improved trial-to-paid conversions by 35%.
- Built performance monitor dashboards on Redash for 8 Interactive.
- Conducted trend and regression analysis for Cathay Financial Holdings on sustainable energy and inclusion topics, and delivered the presentation in infographics on their corporate social responsibility report.
Instructor
Freelancer
Taipei, Taiwan
Feb 2018 - Jul 2018
- Instructed a data visualization and journalism course at NCCU, a top-5 university in Taiwan.
- Assisted as mentor in a four-day internal data science in R camp in BenQ.
Business Intelligence Engineer
Taiwan Star Telecom
Taipei, Taiwan
Sep 2016 - Jul 2017
- Initiated, designed, and implemented the database schema and ETL process for daily KPI reports across channels with scalable and easy-to-modify flexibility for business needs, using Oracle and MySQL. Automated a previously manual report generation process, saving 28 labor-hours per day.
- Optimized SQL scripts, reducing script runtimes from hours to seconds.
Research Assistant
MIT Media Lab
Cambridge, MA
Feb 2016 - Jul 2016
- [Project: Multi-layer interactive maps of cities' topics of interest] Led a team of 3 undergraduate students to analyze civic data through statistics and machine learning methods from data collection to interactive visualizationScraped 400k records reviews, posts, and photos from social media using Python. Obtained descriptive statistics in R to see the whole picture, text mining, and computer vision using NumPy, Pandas, NLTK, TensorFlow to better understand topics of interest.
- [Project: Sightseeing hotspots by time] Processed call detail record data of 200 GB via Shell Script and Python and visualized as minute-scale movement on map in D3.js.
Project Manager & Data Analyst
BigObject
Taipei, Taiwan
Nov 2014 - Jan 2016
- Led pre-sales teams of data and engineering professionals to solve clients’ pain points. Conducted interviews with clients, analyzed their requirements, cleaned and explored their data, and established a self-service analysis and recommendation system pipeline for them. Successfully acquired the startup's first paying customer.
- Built an automatic system to collect real-time tweets and pipe to database via FluentD for demo.
- Identified and categorized customers by transactions, user-data, and web logs to target consumers for a E-commerce client using R, Python, and BigObject (an analytic database).
Channel Sales
Philips
Taipei, Taiwan
Jan 2013 - May 2014
- Marketed Sonicare products to targeted clinics, and improved sales by 20% in 2013 yoy.
COMMUNITY
Founder
Cicadata
The first data journalism & visualization community in Taiwan
Sep 2016 -
A Economics popularization blog
Organizer
Taiwan R User Group
Talks & Events
- May 2014 Study Group: Machine Learning for Hackers, R Ladies Taipei
- Oct 2016 Coordinator, Data Visualization Afternoon Tea by Cicadata
- Dec 2016 Talk: Data Journalism 101, Taiwan R User Group
- Dec 2016 Coordinator & Speaker, Data Visualization - GIS workshop by Cicadata
- Dec 2016 Tech Mentor, Facebook #SheMeansBusiness Workshop Taiwan
- Mar 2017 Coordinator & Mentor, Data Visualization - ggplot2 Workshop by Cicadata
- May 2017 Talk: Data Journalism & Visualization, R Ladies Taipei
- Aug 2017 Coordinator, Women & Data Science Joint Meetup by R Ladies Taipei, PyLadies Taipei & Girls in Tech
- Sep 2017 Talk: Exploratory Data Analysis & Prototyping in PowerBI, R Ladies Taipei
- Jan 2018 Talk: ggplot2 Basic, Taiwan R User Group
- Jan 2018 Coordinator & Mentor, Tableau Day, R Ladies Taipei
- Mar 2018 Panelist, Women in Data Science Conference Taipei 2018
- Aug 2018 Mentor, Kaggle Competition, R Ladies Taipei
- May 2019 Organizer, Visualization Workshop, NATSA Annual Conference
- Jun 2019 Moderator, Women in Silicon Valley, Cafe Philio @ Bay Area