- Differential Dataflow - The code is ugly Rust, but the logic and linked papers are quite interesting.
- Spinning Fast Iterative Dataflows - Flink's execution model. Also, coverage in the Morning Paper.
- Discretized Streams - Spark Streaming's model of operation.
- Google's Dataflow Model - This is now also available as Apache (Incubating) Beam.
- Kafka Streams - Kafka offers "hipster stream processing," and a nice unification between tables and streams.
- General Background and Overview
- Probabilistic Data Structures for Web Analytics and Data Mining : A great overview of the space of probabilistic data structures and how they are used in approximation algorithm implementation.
- Models and Issues in Data Stream Systems
- Philippe Flajolet’s contribution to streaming algorithms : A presentation by Jérémie Lumbroso that visits some of the hostorical perspectives and how it all began with Flajolet
- Approximate Frequency Counts over Data Streams by Gurmeet Singh Manku & Rajeev Motwani : One of the early papers on the subject.
- [Methods for Finding Frequent Items in Data Streams](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.187.9800&rep
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift: http://arxiv.org/pdf/1502.03167v1.pdf | |
TuPAQ: http://arxiv.org/pdf/1502.00068.pdf | |
A Theory of Changes for Higher-Order Languages: http://www.informatik.uni-marburg.de/~pgiarrusso/papers/pldi14-ilc-author-final.pdf | |
Naiad/Timely Dataflow: http://research.microsoft.com/pubs/201100/naiad_sosp2013.pdf | |
Dedalus: http://www.eecs.berkeley.edu/Pubs/TechRpts/2009/EECS-2009-173.pdf |
I hereby claim:
- I am rand on github.
- I am rand (https://keybase.io/rand) on keybase.
- I have the public key with fingerprint 1910 8196 4E35 8470 E8AB C7A7 2C2B 50D5 A3C3 E7CC
To claim this, I am signing this object:
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
require 'formula' | |
class ScalaDocs < Formula | |
homepage 'http://www.scala-lang.org/' | |
url 'http://www.scala-lang.org/downloads/distrib/files/scala-docs-2.9.3.zip' | |
sha1 '633a31ca2eb87ce5b31b4f963bdfd1d4157282ad' | |
end | |
class ScalaCompletion < Formula | |
homepage 'http://www.scala-lang.org/' |
Ideas are cheap. Make a prototype, sketch a CLI session, draw a wireframe. Discussions around concrete examples, not handy-waving abstractions. Don't say you did something, provide a URL that proves it.
Nothing is real until it's being used by a real user. This doesn't mean you make a prototype in the morning and blog about it in the evening. It means you find one person you believe your product will help and try to get them to use it.
I wrote a really simple JavaScript script that uses jQuery to extract all the categories from Facebook's "Create a page" page.
- Navigate to the Create a page page.
- jQuerify the page.
- Open up your JavaScript console of choice and run the following script: