Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Latest commit

 

History

History
History
70 lines (56 loc) · 2.74 KB

File metadata and controls

70 lines (56 loc) · 2.74 KB
Copy raw file
Download raw file
Edit and raw actions

Big Data and Cloud

  1. aws-cli
    Universal Command Line Interface for Amazon Web Services
    Project Source: https://github.com/aws/aws-cli

  2. boto
    Boto is a Python package that provides interfaces to Amazon Web Services.
    Project Source: https://github.com/boto/boto
    Project Documentation: http://docs.pythonboto.org/en/latest/

  3. dpark
    DPark is a Python clone of Spark, MapReduce alike computing framework supporting iterative computation.
    Project Source: https://github.com/douban/dpark
    Project Documentation (Chinese): https://github.com/jackfengji/test_pro/wiki

  4. dumbo
    Python module that allows one to easily write and run Hadoop programs.
    Project Source: https://github.com/klbostee/dumbo
    Project Homepage: http://klbostee.github.io/dumbo/

  5. Fabulous
    Deploy django apps to Amazon EC2 with ONE command.
    Project Source: https://github.com/gcollazo/Fabulous
    Project Homepage: http://elweb.co/

  6. falcon
    Falcon is a high-performance Python framework for building cloud APIs and web app backends.
    Project Source: https://github.com/racker/falcon
    Project Homepage: http://falconframework.org/

  7. gae-boilerplate
    Google App Engine Boilerplate gets your project off the ground quickly using the Google App Engine platform.
    Project Source: https://github.com/coto/gae-boilerplate
    Project Homepage: http://appengine.beecoss.com/

  8. goagent
    a gae proxy
    Project Source: https://github.com/goagent/goagent
    Project Homepage: https://code.google.com/p/goagent/

  9. Hue
    Hue is an open source Web UI for doing big data with Hadoop.
    Project Source: https://github.com/cloudera/hue
    Project Homepage: http://gethue.com/

  10. libcloud
    Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API.
    Project Source: https://github.com/apache/libcloud
    Project Documentation: http://libcloud.apache.org/

  11. mrjob
    Run MapReduce jobs on Hadoop or Amazon Web Services.
    Project Source: https://github.com/Yelp/mrjob
    Project Documentation: https://pythonhosted.org/mrjob/

  12. nova
    OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more.
    Project Source: https://github.com/openstack/nova
    Project Homepage: http://www.openstack.org/

  13. pyes
    pyes is a pythonic way to use ElasticSearch since 2010.
    Project Source: https://github.com/aparo/pyes
    Project Documentation: https://pypi.python.org/pypi/pyes/

  14. s3cmd
    Command line tool for managing Amazon S3 and CloudFront services.
    Project Source: https://github.com/s3tools/s3cmd
    Project Homepage: http://s3tools.org/s3cmd

Morty Proxy This is a proxified and sanitized view of the page, visit original site.