[Day 879] A script to keep track of state-of-the-art AI research

While doing research, I often find myself wondering: “What’s the state-of-the-art result for XYZ right now?” I just want a tool that returns the summary of the latest SOTA research. My usual go-to place is Google, but I quickly realize that Google often returns:

  • papers that have a lot of citations, which means they are old.
  • articles that discuss SOTA systems. Again, these articles can be a couple of years old.
  • irrelevant results that share the same name as XYZ.

At best, I have to skim those results. At worst, I don’t find what I’m looking for at all (I might if I go to the next page results but who does that?) [See the rest of the post and more information on the script that I wrote on huyenchip.com]

Advertisements
[Day 879] A script to keep track of state-of-the-art AI research

[Day 626] I just gave a lecture to 400 students

Today, I was invited to give a guest lecture for the Stanford class CS224N: Natural Language Processing with Deep Learning. I was pretty excited about the opportunity. First, I’d never given a lecture to such a big audience before – there are 400+ students in the class. Second, it’s Richard Socher‘s class. He’s hands down one of the most chill professors I know. For some reason, he always looks like he’s just got out of bed and we occasionally catch him biking down the stairs to the classroom. Third, I’d always heard that speaking at NVIDIA Auditorium is lit and I want to try it out before graduating. Continue reading “[Day 626] I just gave a lecture to 400 students”

[Day 626] I just gave a lecture to 400 students

[Day 276] Detailed instruction on how to do Style Transfer

If you don’t already know, style transfer is the cool, hip thing that has been taking the recreational AI community by storm. It’s so cool that even Kristen Stewart co-authored a paper about it. To quote one researcher who has done extensive work in style transfer that I’ve got a chance to talk to, “it is an utterly unremarkable paper that wouldn’t have been published otherwise [if Kristen Stewart’s name is not on it]. That’s a publicity stunt.”

kristen stewart style transfer.jpgSome background on why I’m doing this: I’m teaching the course CS 20SI: “TensorFlow for Deep Learning Research” and for the assignment about convolution neural networks, I thought it’d be fun for students to do style transfer as their exercise at home. They, after all, showed a lot of enthusiasm when we did Deep Dream in class.

Continue reading “[Day 276] Detailed instruction on how to do Style Transfer”

[Day 276] Detailed instruction on how to do Style Transfer

[Day 136] Meet Richard Stebbing–my unbelievable boss

Last summer, I worked under Richard Stebbing and he is kinda a genius. I googled him the other day and found out that he finished his undergrad engineering degree in  3 years with straight A-plus. He then became a Rhodes scholar at Oxford, finishing his PhD also in 3 years. Every time I see him code, I’m like: “Wow, you can do that?” You can check out his GitHub. But whenever he wasn’t blowing me away with his coding skills, he made a sport out of making fun of me.

Continue reading “[Day 136] Meet Richard Stebbing–my unbelievable boss”

[Day 136] Meet Richard Stebbing–my unbelievable boss

[Day 24] We built an automated essay grading system

For our CS224D’s final project, Lucio and I took on Kaggle’s Automated Essay Scoring competition. We tried to build a model that can automatically grade your essay. You input an essay and voila, it outputs the score for it. The dataset we have is for essays grade 7 to 10, but the model is easily scalable. It can be used to grade SAT/ACT practice essays or any kind of essays, as long as we have enough training data.

Continue reading “[Day 24] We built an automated essay grading system”

[Day 24] We built an automated essay grading system