Chee-Keong Choo

​I’m a ​Data Scientist/Analyst with a background in Statistics and Agronomy. My current job focus is on data cleaning and integrate databases with customers' ERP systems. I am also interested in Artificial Intelligence using NLP and while I am not doing that, I enjoy riding my bicycle, watching movies and reading some good books.

Experience

Senior I.T. Analyst/Technician Technician

UPS

Provided data EDA service to customers prior to database integration. The most I have done was 50k packages shipped in a week after EDA. Delivered BI analysis for customer if they have questions about the discrepencies between their invoices and and the shipping charges from the shipping system. Supported UPS's internal and external IT needs throughout the entire Southern AZ. Work well under tight deadlines and good with time management.

May 2008 - Present

Operation/Quality Assurance Supervisor

An-An Disposable Sdn. Bhd.

Assigned and supervised the quality and operation departments. Making data driven decisions and helped startup the company within 2 months of deployment. Hired entire factory staff and workers with help from HR department and Industrial Engineering department.

December 1997 - December 1999

Management Trainee

Fuburg Industrial Co.

My main goal as trainee per the CEO was to acquire industrial cetificates such as 5S, TPM, TQM, 6 sigma and perform BI analysis for the operation and QC department.

July 1995 - December 1997

Lab Assistant

National Taiwan University Lab of Quantitative Genetics

Collaborated with PI Dr. Hwu and worked on watermelon and sorghum species identification using genetic markers and performed designed of experiment analysis on the results.

September 1993 - July 1995

Projects

Loan Marketing Data EDA

Performed data cleaning on a messy bank loan marketing data.
Demonstrated methods to reveal missing values and outliers and
how to deal with them. Visualized charateristics of end results

• Skills/Tools: Google Colab, Pandas, Matplotlib, Seaborn, Numpy

• Picture from Freepik

Handling Outliers and EDA in Pima Native American Dataset

Performed data cleaning on Pima Indian diabetes dataset. There were some
values in the dataset that were invalid if the test subject is
alive, namely bloodpressure 0 and skin thickness 0. We attempt to explain this
scenario and correct them with reasonable values. After that we use boxplot
and other visualization tools to review charateristics of the dataset.

• Skills/Tools: Seaborn, Pandas, IQR, boxplot, univariate and and multivariate plot, Z-Score

• Picture from Freepik

Pima Indian Diabetes Data Visualization

There's a saying, a picture worth a thousand words. In this section we
will be using visualization tools to display the data and paint a picture
of the data and let the data tell us stories of these women

• Skills/Tools: Tableau, Google Data Studio

• Picture from Freepik

Pima Indian Diabetes Classification

Performed diabetes classification on patients from Pima County.
Utilized several classification algorithms to classify diabetes
patients. Discussed ways of handling small imbalanced dataset.
Talked about data leakage and the reason behind up/down sampling.

• Skills/Tools: Pandas, Numpy, Matplotlib, Seaborn,
Pandas Profiling, LogisticRegression, RandomForestClassfier,
XGBClassifier, KNeighborsClassifier, GridSearchCV,

• Picture from Freepik

What causes Native American women to become diabetic?

Performed data cleaning on a messy bank loan marketing data.
Demonstrated methods to reveal missing values and outliers and
how to deal with them. Visualized charateristics of end results

• Skills/Tools: Google Colab, Pandas, Matplotlib, Seaborn, Numpy

• Picture from Freepik

Under Construction

Stay tuned, researching ideas and looking for good stuff to present.

• Skills/Tools: NA

• Picture from Freepik

Education

Thomas Edison State University

Master of Science
Data Science and Analytics

GPA: 4.0

November 2019 - March 2022

A.T. Still University School of Osteopathic Medicine

Master of Science
Medical Informatics

GPA: 4.0

August 2001 - May 2003

National Taiwan University

Bachelor of Science
Agronomy and Statistics

GPA: 3.0

August 1990 - May 1994

Skills

Programming Languages & Tools
Workflow
Download Resume Here

Interests

Apart from being a data scientist/analyst, I enjoy most of my time being outdoors. I enjoy riding my bicycle in the loop (Tucson bike path), watching movies, and reading some good books.

When forced indoor, I spend a large amount of my free time exploring the latest technolgy advancements in the data science world.

Awards & Certifications