Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Best storage layer for small files upto 10MB
3 points
QueenRitchie
8 years ago
discuss
2.
Cloudera taken private for $5.3B, acquires Datacoral and Cazena (blog.cloudera.com)
153 points
swyx
5 years ago
109 comments
3.
Algorithms Every Data Scientist Should Know: Reservoir Sampling (blog.cloudera.com)
120 points
Irishsteve
13 years ago
40 comments
4.
A Guide to Python Frameworks for Hadoop (blog.cloudera.com)
93 points
laserson
13 years ago
15 comments
5.
Common Probability Distributions: The Data Scientist’s Crib Sheet (blog.cloudera.com)
90 points
ingve
10 years ago
7 comments
6.
BinaryPig: Malware analysis with Hadoop, Django and Elasticsearch (blog.cloudera.com)
33 points
whalesalad
13 years ago
2 comments
7.
Benchmarking Time Series Workloads with Kudu, InfluxDB and ClickHouse (blog.cloudera.com)
14 points
bankim
6 years ago
1 comment
8.
How to Resample from a Large Data Set in Parallel with R on Hadoop (blog.cloudera.com)
14 points
laserson
13 years ago
discuss
9.
Kudu: New Apache Hadoop Storage Engine (blog.cloudera.com)
14 points
justin_hancock
11 years ago
discuss
10.
How Spark, Scala, and Functional Programming Made Hard Problems Easy at Barclays (blog.cloudera.com)
11 points
ddispaltro
10 years ago
discuss
11.
Feather: A Fast On-Disk Format for Data Frames for R and Python (blog.cloudera.com)
9 points
jkestelyn
10 years ago
discuss
12.
Wargaming.net’s Data-Driven, Real-Time Rules Engine (blog.cloudera.com)
8 points
alexatkeplar
10 years ago
1 comment
13.
Cloudera Impala: Real-Time Queries in Apache Hadoop (blog.cloudera.com)
7 points
albietz
14 years ago
discuss
14.
Inside Cloudera Impala: Runtime Code Generation with LLVM (blog.cloudera.com)
6 points
tlipcon
13 years ago
discuss
15.
Apache Kudu and Apache Impala (Incubating): The Integration Roadmap (blog.cloudera.com)
5 points
samber
10 years ago
discuss
16.
Ibis on Impala: Python at Scale for Data Science (blog.cloudera.com)
4 points
denzil_correa
11 years ago
1 comment
17.
New in Cloudera Labs: Google Cloud Dataflow on Apache Spark (blog.cloudera.com)
4 points
crb
11 years ago
1 comment
18.
Impyla: a new Python client for Impala (blog.cloudera.com)
4 points
laserson
12 years ago
1 comment
19.
How-to: Use IPython Notebook with Apache Spark (blog.cloudera.com)
4 points
laserson
12 years ago
discuss
20.
Finally, A Software Development Kit for Hadoop (blog.cloudera.com)
4 points
justinkestelyn
13 years ago
discuss
21.
Cloudera open sources distributed test running infrastructure (blog.cloudera.com)
4 points
tlipcon
10 years ago
discuss
22.
Jeff Dean's talk at Cloudera (blog.cloudera.com)
3 points
mrry
12 years ago
1 comment
23.
How-to: Use Eclipse with MapReduce in Cloudera’s QuickStart VM (blog.cloudera.com)
3 points
Dekku
13 years ago
discuss
24.
A Ruby Client for Impala (blog.cloudera.com)
3 points
colinmarc
13 years ago
discuss
25.
LLVM Async Codegen in Apache Impala for Low Latency SQL (blog.cloudera.com)
3 points
superdupershant
6 years ago
discuss
26.
Cloudera explains Altus SDX, a managed metastore for Hadoop workloads (blog.cloudera.com)
3 points
tigerBL00D
8 years ago
discuss
27.
How-To: Do Scalable Graph Analytics with Apache Spark (blog.cloudera.com)
3 points
samber
10 years ago
discuss
28.
Multi-Node Clusters with Cloudera QuickStart for Docker (blog.cloudera.com)
3 points
samber
10 years ago
discuss
29.
Cloudera: How-To: Ingest Email into Apache Hadoop in Real Time for Analysis (blog.cloudera.com)
3 points
samber
10 years ago
discuss
30.
Livy, the Open Source REST Service for Apache Spark, Joins Cloudera Labs (blog.cloudera.com)
3 points
samber
10 years ago
discuss
More