Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Best storage layer for small files upto 10MB
3 points
QueenRitchie
8 years ago
discuss
2.
▲
Cloudera taken private for $5.3B, acquires Datacoral and Cazena
(blog.cloudera.com)
153 points
swyx
5 years ago
109 comments
3.
▲
Algorithms Every Data Scientist Should Know: Reservoir Sampling
(blog.cloudera.com)
120 points
Irishsteve
13 years ago
40 comments
4.
▲
A Guide to Python Frameworks for Hadoop
(blog.cloudera.com)
93 points
laserson
13 years ago
15 comments
5.
▲
Common Probability Distributions: The Data Scientist’s Crib Sheet
(blog.cloudera.com)
90 points
ingve
10 years ago
7 comments
6.
▲
BinaryPig: Malware analysis with Hadoop, Django and Elasticsearch
(blog.cloudera.com)
33 points
whalesalad
13 years ago
2 comments
7.
▲
Benchmarking Time Series Workloads with Kudu, InfluxDB and ClickHouse
(blog.cloudera.com)
14 points
bankim
6 years ago
1 comment
8.
▲
How to Resample from a Large Data Set in Parallel with R on Hadoop
(blog.cloudera.com)
14 points
laserson
13 years ago
discuss
9.
▲
Kudu: New Apache Hadoop Storage Engine
(blog.cloudera.com)
14 points
justin_hancock
11 years ago
discuss
10.
▲
How Spark, Scala, and Functional Programming Made Hard Problems Easy at Barclays
(blog.cloudera.com)
11 points
ddispaltro
10 years ago
discuss
11.
▲
Feather: A Fast On-Disk Format for Data Frames for R and Python
(blog.cloudera.com)
9 points
jkestelyn
10 years ago
discuss
12.
▲
Wargaming.net’s Data-Driven, Real-Time Rules Engine
(blog.cloudera.com)
8 points
alexatkeplar
10 years ago
1 comment
13.
▲
Cloudera Impala: Real-Time Queries in Apache Hadoop
(blog.cloudera.com)
7 points
albietz
14 years ago
discuss
14.
▲
Inside Cloudera Impala: Runtime Code Generation with LLVM
(blog.cloudera.com)
6 points
tlipcon
13 years ago
discuss
15.
▲
Apache Kudu and Apache Impala (Incubating): The Integration Roadmap
(blog.cloudera.com)
5 points
samber
10 years ago
discuss
16.
▲
Ibis on Impala: Python at Scale for Data Science
(blog.cloudera.com)
4 points
denzil_correa
11 years ago
1 comment
17.
▲
New in Cloudera Labs: Google Cloud Dataflow on Apache Spark
(blog.cloudera.com)
4 points
crb
11 years ago
1 comment
18.
▲
Impyla: a new Python client for Impala
(blog.cloudera.com)
4 points
laserson
12 years ago
1 comment
19.
▲
How-to: Use IPython Notebook with Apache Spark
(blog.cloudera.com)
4 points
laserson
12 years ago
discuss
20.
▲
Finally, A Software Development Kit for Hadoop
(blog.cloudera.com)
4 points
justinkestelyn
13 years ago
discuss
21.
▲
Cloudera open sources distributed test running infrastructure
(blog.cloudera.com)
4 points
tlipcon
10 years ago
discuss
22.
▲
Jeff Dean's talk at Cloudera
(blog.cloudera.com)
3 points
mrry
12 years ago
1 comment
23.
▲
How-to: Use Eclipse with MapReduce in Cloudera’s QuickStart VM
(blog.cloudera.com)
3 points
Dekku
13 years ago
discuss
24.
▲
A Ruby Client for Impala
(blog.cloudera.com)
3 points
colinmarc
13 years ago
discuss
25.
▲
LLVM Async Codegen in Apache Impala for Low Latency SQL
(blog.cloudera.com)
3 points
superdupershant
6 years ago
discuss
26.
▲
Cloudera explains Altus SDX, a managed metastore for Hadoop workloads
(blog.cloudera.com)
3 points
tigerBL00D
8 years ago
discuss
27.
▲
How-To: Do Scalable Graph Analytics with Apache Spark
(blog.cloudera.com)
3 points
samber
10 years ago
discuss
28.
▲
Multi-Node Clusters with Cloudera QuickStart for Docker
(blog.cloudera.com)
3 points
samber
10 years ago
discuss
29.
▲
Cloudera: How-To: Ingest Email into Apache Hadoop in Real Time for Analysis
(blog.cloudera.com)
3 points
samber
10 years ago
discuss
30.
▲
Livy, the Open Source REST Service for Apache Spark, Joins Cloudera Labs
(blog.cloudera.com)
3 points
samber
10 years ago
discuss
More