- Thanks for popping by! As an avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and tech while having fun at the same time.
- I welcome you to join me on a journey of data science discovery! Follow me on GitHub, Medium, and LinkedIn to stay updated with more engaging and practical content.
- You can find my data science portfolio here, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to connect for a chat (coffee or virtual) to discuss shared interests and topics!
- Computer Vision
- Database Management
- Data Extraction and Web Scraping
- Data Science Certification Guides
- Data Science Toolkit
- Data Science in the Real World
- Generative AI and Agentic AI
- Insights from Data Science Talks
- Machine Learning
- MLOps
- Natural Language Processing
- Networks and Graphs
- Responsible AI
- Sports Analytics
- Visualization
- Web Development
- Web3 and Metaverse
- Writing for DataCamp
- Writing Tips
Projects with โญ are my personal favourites, so do check them out!
| Title | Article | Repo |
|---|---|---|
| Classifying Images of Alcoholic Beverages with fast.ai v2 | ๐ | ๐ |
| Russian Car Plate Detection with OpenCV and TesseractOCR | ๐ | ๐ |
| Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER) | ๐ | ๐ |
| Top Python libraries for Image Augmentation in Computerย Vision | ๐ | ๐ |
| โญ PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet | ๐ | ๐ |
| Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| โญ Definitive Guide to Creating a SQL Database on Cloud with AWS and Python | ๐ | ๐ |
| PyMySQLโ-โConnecting Python andย SQL for Data Science | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| Using OneMap API to extract Singapore postal codes, coordinates and travel distance | - | ๐ |
| A Detailed Web Scraping Walkthrough Using Python and Selenium | ๐ | ๐ |
| โญ How to Web Scrape Wikipedia using LangChain Agents and Tools with OpenAI's LLMs and Functionย Calling | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| 3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks | ๐ | ๐ |
| 3 Steps to Get Tableau Desktop Certified in 2 Weeks | ๐ | - |
| โญ No-Frills Guide to Passing the AWS Certified Machine Learning Specialty Exam | ๐ | - |
| Microsoft Certified: Azure AI Engineer Associate - Study Notes | - | ๐ |
| Title | Article | Repo |
|---|---|---|
| Common Python codes for Data Wrangling | - | ๐ |
| Enhance your Python codeโs readability with pycodestyle | ๐ | - |
| Free Resources for Generating Realistic Fake Data | ๐ | - |
| Most Starred and Forked GitHub Repos for Data Science and Python | ๐ | - |
| Most Starred and Forked GitHub Repos for Data Science and R | ๐ | - |
| Automatically Generate Machine Learning Code with Just a Few Clicks | ๐ | - |
| Read and Modify Image Metadata withย Python | ๐ | ๐ |
| Top Tips to Google Search Like a Seasoned Data Scientist | ๐ | - |
| How to Swap Day and Month of Incorrectly Formatted Excel Dates | ๐ | - |
| Title | Article | Repo |
|---|---|---|
| Exploring Illegal Drugs in Singapore โ A Data Perspective | ๐ | ๐ |
| Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution | - | ๐ |
| Healthcareโs AI Future โ In Conversation with Andrew Ng and Fei-Fei Li | ๐ | - |
| Real-World Data Science Use Cases in the Insurance Industry | ๐ | - |
| โญ Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| Generative AI Pharmacist - Macy | ๐ | ๐ |
| โญ ChatPod - Q&A over your Podcasts with Whisper, FAISS, and LangChain | ๐ | ๐ |
| โญ Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Documentย Q&A | ๐ | ๐ |
| โญ Finance-LLMs - Comprehensive Compilation of Real-World LLM Implementation in Financial Services | ๐ | ๐ |
| โญ Text-to-Audio Generation with Bark, Clearly Explained | ๐ | ๐ |
| Guide to ChatGPT's Advanced Settings โ Top P, Frequency Penalties, Temperature, and More | ๐ | - |
| Inside the Leaked System Prompts of GPT-4, Gemini 1.5, Claude 3, andย More | ๐ | - |
| โญ Exposing Jailbreak Vulnerabilities in LLM Applications with ARTKIT | ๐ | ๐ |
| How to Benchmark DeepSeek-R1 Distilled Models on GPQA Using Ollama and OpenAI's simple-evals | ๐ | ๐ |
| Using Googleโs LangExtract and Gemma for Structured Data Extraction | ๐ | ๐ |
| โญ How Agent Handoffs Work in Multi-Agent Systems | ๐ | ๐ |
| How Agents Plan Tasks with To-Do Lists | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| Bridging AIโs Proof-of-Concept to Production Gap โ Insights from Andrew Ng | ๐ | - |
| Title | Article | Repo |
|---|---|---|
| Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis | ๐ | ๐ |
| Using Ensemble Regressors to Predict Condominium Rental Prices | ๐ | ๐ |
| The Dying ReLU Problem, Clearly Explained | ๐ | - |
| Why Bootstrapping Actually Works | ๐ | - |
| โญ Assumptions of Logistic Regression, Clearly Explained | ๐ | ๐ |
| Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish | ๐ | ๐ |
| Credit Card Fraud Detection with AutoXGB | ๐ | ๐ |
| โญ Micro, Macro & Weighted Averages of F1 Score, Clearly Explained | ๐ | - |
| Principal Component Regression - Clearly Explained and Implemented | ๐ | ๐ |
| โญ Feature Selection with Simulated Annealing in Python, Clearly Explained | ๐ | ๐ |
| Quick Primer on Types of Missing Data and Imputation Techniques | ๐ | - |
| Imputation of Missing Data in Tables withย DataWig | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| Key Learning Points from MLOps SpecializationโโโCourse 1/4 | ๐ | ๐ |
| Key Learning Points from MLOps SpecializationโโโCourse 2/4 | ๐ | ๐ |
| Key Learning Points from MLOps SpecializationโโโCourse 3/4 | ๐ | ๐ |
| Key Learning Points from MLOps SpecializationโโโCourse 4/4 | ๐ | ๐ |
| โญ End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell | ๐ | ๐ |
| โญ How to Dockerize Machine Learning Applications Built with H2O, MLflow, FastAPI, and Streamlit | ๐ | ๐ |
| โญ Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| COVID-19 Vaccine โ Whatโs the Public Sentiment? | ๐ | ๐ |
| Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| โญ Network Analysis and Visualization of Drug-Drug Interactions | ๐ | ๐ |
| How to Deploy Interactive Pyvis Network Graphs on Streamlit | ๐ | ๐ |
| A No-Code Approach to Building Knowledge Graphs | ๐ | ๐ |
| โญ Text-to-SQL with GraphRAG on Neo4j Knowledge Graph Semantic Representation of SQL Databases | Coming Soon | ๐ |
| Title | Article | Repo |
|---|---|---|
| Responsible AI Masterclass (for Institute of Banking and Finance Singapore) | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| โญ Analyzing English Premier League VAR Football Decisions | ๐ | ๐ |
| Combining Python and R for FIFA Football World Ranking Analysis | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash) | - | ๐ |
| Visualizing Fortune 500 Companies in a Bar Chart Race | ๐ | ๐ |
| How to Easily Draw Neural Network Architecture Diagrams | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| โญ Post COVID-19 Vaccination Wait-Time Tracker (with Python Flask) | ๐ | ๐ |
| From HTTP to HTTPS โ Easily Secure Flask Web Apps With Talisman | ๐ | - |
| โญ Food King Directory (in collaboration with Night Owl Cinematics) | ๐ | ๐ |
| Title | Article | Repo |
|---|---|---|
| The Web3 / Metaverse Glossary โ A Keyword Guide to the Tech Future | ๐ | - |
| Title | Article | Repo |
|---|---|---|
| โญ What Mature Data Infrastructure Looks Like | ๐ | - |
| Democratizing Data in Government Agencies | ๐ | - |
| A Survey Into Data Governance Tools | ๐ | - |
| Scaling Data Science With Data Governance | ๐ | - |
| 3 Reasons Why All Teams Should Learn SQL | ๐ | - |
| 3 Reasons Why All Teams Should Learn R | ๐ | - |
| How Tableau Helps Your Organization Achieve Greater Data Insights | ๐ | - |
| How PowerBI Helps Your Organization Achieve Greater Data Insights | ๐ | - |
| Title | Article | Repo |
|---|---|---|
| Create a Clickable Table of Contents for Your Medium Posts | ๐ | - |