Tumblr

Tumblr is a New York City start up that empowers creators and explorers. Founded by David Karp in 2007, Tumblr is now home to more than 90 million tumblelogs and 40 billion posts across a wide swath of communities and topics. From our headquarters near Union Square, our highly caffeinated engineers work in an open, positive and collaborative environment. We focus on creating intuitive and beautiful ways to share, discover and connect, while engineering new ways to bring Tumblr to more people around the world.

Tumblr is backed by Union Square Ventures, Spark Capital and Sequoia Capital.

Technologies: Ruby, PHP, Scala, Hadoop, Redis, MySQL
 

In this talk, from Tumblr gives an “Introduction to Digital Signal Processing in Hadoop”. Adam introduces the concepts of digital signals, filters, and their interpretation in both the time and frequency domain, and he works through a few simple examples of low-pass filter design and application. It’s much more application focused than theoretical, and there is no assumed prior knowledge of signal processing. This talk was recorded at the NYC Machine Learning Meetup at Pivotal Labs.

Adam also works through how they can be used either in a real-time stream or in batch-mode in Hadoop (with Scalding).  He also has some examples of how to detect trendy meme-ish blogs on Tumblr.

Slides & Bio…

Proudly hosted by WPEngine