Categories
-
Recent Posts
Tags
- algorithm
- big data
- Brooklyn Bridge
- cache
- campus
- cascading
- cloud
- coherence
- commencement
- complexity
- DistributableStream
- distributed datasets
- food
- fremont
- hadoop
- hardware
- hot pot
- java
- JDBC
- kids
- landscape
- life
- new haven
- Oracle
- parallel algorithms
- parallel computing
- performance
- programming
- redwood shores
- Reservoir sampling
- spark
- storage
- storm
- task assignment
- tez
- thread pool
- travel
- upthere
- usenix fast'16
- yacht
- Yale
Archives
May 2024 M T W T F S S 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Tag Archives: Reservoir sampling
Distributed/Parallel Reservoir Sampling
Reservoir sampling is a family of randomized algorithms for randomly choosing a sample of k items from a list S of n items, where n is either very large or unknown until the list is traversed. In most of the applications, n is … Continue reading
Posted in Algorithm
Tagged big data, distributed datasets, hadoop, parallel algorithms, Reservoir sampling
1 Comment