Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spar…
#
mapreduce
Repositories 636
Redisson - distributed Java objects and services (Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock,…
java
redis
lock
map
list
queue
executor
redis-cluster
redis-client
distributed
set
distributed-locks
tomcat
hibernate
cache
scheduler
mapreduce
spring
session
Java
Updated Mar 22, 2019
Python clone of Spark, a MapReduce alike framework in Python
Python
Updated Jan 23, 2019
C# and F# language binding and extensions to Apache Spark
spark
apache-spark
rdd
dataframe
dstream
dataset
streaming
csharp
mobius
kafka-streaming
spark-streaming
fsharp
bigdata
mapreduce
eventhubs
near-real-time
C#
Updated Dec 24, 2018
Go
Updated Mar 19, 2019
distributed_computing include mapreduce kvstore etc.
Go
Updated Jun 26, 2017
An open source framework for building data analytic applications.
unified
integration
platform
dataset
mapreduce
spark
spark-streaming
java
java-8
cdap
python
middleware
Java
Updated Mar 22, 2019
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Java
Updated Apr 25, 2018
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Python
Updated Nov 20, 2018
Go
Updated May 29, 2018
Asakusa Framework
Java
Updated Jan 16, 2019
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Java
Updated Nov 12, 2015
Python Data Processing library
Python
Updated Jan 6, 2019
A Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Java
Updated May 2, 2018
MapReduce by examples
Java
Updated Oct 28, 2016
Scalable RNA-seq analysis
Python
Updated Mar 15, 2018
A light-weight distributed stream computing framework for Golang
Go
Updated May 15, 2018
Big Data for Data Engineers Coursera Specialization from Yandex
Jupyter Notebook
Updated Nov 20, 2018
IPDC(InterPlanetary Distributed Computing) is the Distributed Computation service, A peer-to-peer hypermedia protocol…
Python
Updated May 7, 2018
Map Reduce Implementation of Connected Component on Apache Spark
Scala
Updated Dec 15, 2018
An ORM library that helps you [1] read/write HBase rows in a clean way [2] write+test MapReduce jobs that read from a…
hbase
orm
mapreduce
hadoop-mapreduce
java-annotations
object-mapping
junit
java-libraries
column-family
hbase-orm
Java
Updated Dec 3, 2018
Handy enumerable operations implementation.
Elixir
Updated Mar 15, 2019
gomrjob - a Go Framework for Hadoop Map Reduce Jobs
Go
Updated Sep 21, 2018
A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
HCL
Updated Feb 26, 2018
Java library for running Serverless MapReduce jobs
Java
Updated Aug 10, 2017
Масштабируемое машинное обучение и анализ больших данных с Apache Spark
Jupyter Notebook
Updated Mar 11, 2018
Java
Updated Mar 5, 2019
Java
Updated Apr 8, 2018
Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce framewo…
high-performance-computing
data-analytics
mapreduce
mpi
memory-efficiency
performance-and-scalability
C++
Updated Nov 12, 2018
JSON Store with MQTT Interface 📚 📂 📡
JavaScript
Updated May 30, 2018

