Data Cleaning

Data cleaning is the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. It is a key part of data science, and it can be deeply frustrating. What should we do to the missing values? Why the dates are not in the correct format? How to clean up inconsistent data entry? These are some of the problems that we will learn to tackle in this course.

August 19, 2023 · 10 min · Kean Teng Blog

Time Series

The most common application of machine learning in the real world is forecasting. For example, businesses forecasting product demand, governments forecasting economic growth and meteorologists forecasting the weather.

August 19, 2023 · 25 min · Kean Teng Blog

Data Visualization

In this course, we will explore on data visualization using seaborn, a Python package to visualize data with a variety of plot types. The package is powerful yet easy to use, check out the below images on the plot types that seaborn is able to generate

August 18, 2023 · 5 min · Kean Teng Blog

Pandas

In this course, we will explore on the Python pandas module which is a popular library for data analysis. With pandas, we can use it to create data and also work or manipulate the existing data.

August 18, 2023 · 11 min · Kean Teng Blog

A Streamlit App For Flood Analysis

In this project, I make use of Streamlit, which is an open-source Python library that allows us to build and deploy powerful apps with speed and efficiency. It also offers a cloud deployment feature for you to host the Streamlit app that you created online publicly through Streamlit Community Cloud.

August 13, 2023 · 5 min · Kean Teng Blog

Flood Risk Modeling With Logistic Regression

In econometrics, the ordinary least square (OLS) model is widely used to estimate the parameter of a linear regression model.

August 13, 2023 · 7 min · Kean Teng Blog

Webscraping On Xpath

Google Sheet IMPORTXML() function is a wonderful tool to allow you to scrape for website information by just identifying the corresponding Xpath. But it has a limit, when you have hundreds or thousands of data items to be gathered, your Sheet’s cell will get stuck at endless loading.

August 13, 2023 · 4 min · Kean Teng Blog

Geospatial Analysis

Geospatial analysis is the gathering, display and manipulation of imagery, GPS, satellite photography and historical data, described explicitly in terms of geographic coordinates. This course will learn on methods to visualize geospatial data and perform some analysis concerning a particular geographic location or region.

April 19, 2023 · 9 min · Kean Teng Blog

Computer Vision

Computer vision literally means computer able to see and recognize stuff. Applications such as Google Lens and Google Image Search are some good examples of where computer vision is being used in our daily life. In this course, we will explore some technique used to empower computer with the power of seeing:

March 20, 2023 · 7 min · Kean Teng Blog

Intro to SQL

For examining and assessing huge datasets and databases, SQL or structured programming language skills plays a vital role to enable us to design and manage data. Some common keywords used in SQL as follows: SELECT, WHERE FROM GROUP BY, HAVING, COUNT ORDER BY AS, WITH JOIN. Date Preparation Since we will be

March 15, 2023 · 7 min · Kean Teng Blog