>

RPA Career Talk

I was invited by the Rafflesian Parents Association to give a talk on being a data scientist (as part of a five speaker panel) earlier today. Here are the slides. Was not totally happy as I didn’t realise that Google Sheets speaker notes covers the actual slides when I was screen sharing over zoom. Also, this is the first time I’ve actually given a career talk, and realised it was 15 years ago when I was sitting in the audience on the other side. Time flies! ...

August 30, 2020 · 1 min · Shen Ting

Rating Systems (1): Elo and its limitations

Introduction This is going to be a new series on rating systems, which is a vastly underrated (pun intended) area of statistics and data science. Rating systems has actually been a part of my life (and probably yours), from early days in chess and then games with matchmaking like CS:GO and Valorant, to now thinking if contract bridge should also have one. Historical Context Having been around for centuries, chess is a game which people have wasted much time on arguing/debating who is the best player. It’s probably slightly surprising then that the first modern rating systems only appeared around or after the end of World War 2. The first systems (Ingo and Harkness) were quite simple and used the idea of the average rating of opponents with adjustments for the results. ...

August 14, 2020 · 4 min · Shen Ting

DataScience SG Talk on Sample Count and Podcast

A while ago, I wrote about the GE2020 sample count. Together with Yong Sheng,[] we gave a talk about this at DataScience SG last night (Youtube link)](https://www.youtube.com/watch?v=U9-zax0mMrw). Do also check out the second talk as reinforcement learning is always an interesting subject - props to Siddarth for giving that quick summary of RL! Also, Symbolic Connection’s episode featuring me is now live! Thanks to Koo Ping Shung for organizing both of the above. ...

July 8, 2020 · 1 min · Shen Ting

How Much Should You Trust the Sample Count?

Election season is upon us again here in Singapore and Polling Day is this Friday. The last election in 2015 introduced the sample count. What is the sample count? From the ELD Website: From the votes cast at each polling station, a counting assistant picks up a random bundle of 100 ballot papers (in front of the candidates and counting agents present) and counts the number of votes for each candidate (or group of candidates in the case of a GRC). ...

July 8, 2020 · 3 min · Shen Ting

How Much Should You Trust the Sample Count?

(Note: This was originally posted on Facebook) TL;DR While the sample counts of this year’s GE saw some huge deviations, they are mostly within expectation. The two hour extension brought about some online groans from friends who didn’t want to stay up late to follow the results. To help alleviate some of this anxiety, I decided to set up a tracking sheet for the sample counts with estimated win probabilities to let people decide if the final count was worth waiting for. ...

July 8, 2020 · 4 min · Shen Ting

Organizing Data Science Projects

In the past 8 months, I’ve probably worked on close to 10 different projects. While half of these consists of not more than a few Jupyter notebooks, the others consist of intermediate data and different notebooks for preprocessing and modelling. Cookiecutter seems to be a good solution and framework: https://drivendata.github.io/cookiecutter-data-science/ Refactoring those projects will take some effort, but I believe it will be well worth the time to do so.

January 18, 2020 · 1 min · Shen Ting