Project Repo

个人项目

Menu 菜单

Menu

菜单

  • About
  • 关于我
  • CV & Resume
  • 个人简历
  • Project Repo
  • 个人项目
  • Resources
  • 资源

Project Repo

个人项目

This page is a repository of my side-projects, including both data science related works and SDE related works.

本页用于展示我过去做过的一些个人认为有趣的项目。本页目前仍在创建中,敬请期待!

Fuzzy Matching At Scale

Last Updated: 4/20/2023

Oftentimes data analysts need to merge datasets with identifiers that are close but not exactly the same. When the generic Python fuzzy-matching modules cannot handle the O(n2) time complexity operation of comparison, we need to do something else.

PandasTF-IDF vectorization

Cosine SimilarityNLP

Learn more

mySQL Learning Series

Last Updated: 7/4/2023

An active recording of my progress on self-learning mySQL.

mySQLData Modeling

Learn more