-
under Apache License 2.0 license
-
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
-
under MIT License license
-
:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis
-
under Apache License 2.0 license
-
Deep Learning Pipelines for Apache Spark
-
under MIT License license
-
:blush: :musical_note: MusicPlayer 一站式收听多平台音乐(网易云, 虾米, QQ)的跨平台音乐播放器,尽情享受吧~:sparkles:
-
under MIT License license
-
Distributed Deep learning with Keras & Spark
-
under Apache License 2.0 license
-
Koalas: pandas API on Apache Spark
-
under Apache License 2.0 license
-
(Deprecated) Scikit-learn integration package for Apache Spark
-
under Apache License 2.0 license
-
PySpark + Scikit-learn = Sparkit-learn
-
under GNU General Public License v3.0 license
-
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
-
under Apache License 2.0 license
-
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
-
under Apache License 2.0 license
-
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
-
under MIT License license
-
Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt
-
under Apache License 2.0 license
-
-
under Apache License 2.0 license
-
High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features :sparkles:
-
under MIT License license
-
:sparkles: Python library and CLI to upload photo and video on Instagram. W/o a phone!
-
under MIT License license
-
Spark Knn Recommender
-
under Apache License 2.0 license
-
Sparkling Pandas
-
under MIT License license
-
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
-
under BSD 3-Clause "New" or "Revised" License license
-
-
under Apache License 2.0 license
-
Visualize streaming machine learning in Spark
-
under MIT License license
-
LearningApacheSpark
-
under MIT License license
-
Create HTML profiling reports from Apache Spark DataFrames
-
under MIT License license
-
Process Common Crawl data with Python and Spark
-
under MIT License license
-
Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall
-
under GNU General Public License v3.0 license
-
Code meant to be run on an Odroid to allow an ArduCopter Pixhawk based multicopter to find red balloons for Sparkfun's AVC 2014 competition
-
under MIT License license
-
An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parses csv data into SchemaRDD. No installation required, simply include pyspark_csv.py via SparkContext.
-
under Apache License 2.0 license
-
PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.
-
under MIT License license
-
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
-
under Apache License 2.0 license
-
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
-
under Apache License 2.0 license
-
Distributed scikit-learn meta-estimators in PySpark
-
under MIT License license
-
百万英雄/冲顶大会/知识超人 答题助手 瞬间使用Chrome打开百度
-
under MIT License license
-
Easy to use library to bring Tensorflow on Apache Spark
-
under GNU General Public License v3.0 license
-
Kodi addon for finding acestream links
-
under Mozilla Public License 2.0 license
-
Spark bindings for Mozilla Telemetry
-
under The Unlicense license
-
Materials for my Spark tutorial at Pycon 2015
-
under MIT License license
-
MTN MoMo API Client Library for Python
-
under Apache License 2.0 license
-
spark plugin for dbt
-
under Apache License 2.0 license
-
A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator
-
under MIT License license
-
Translator from USB-Rubber-Ducky payloads to a Digispark code.
-
under Apache License 2.0 license
-
SQLflow based on python development, support to Spark, as the underlying distributed computing engine, through a set of unified configuration file to complete the batch, flow calculation, the Rest service development.
-
under Apache License 2.0 license
-
Apache (Py)Spark type annotations (stub files).
-
under MIT License license
-
Real-time Machine Learning with Apache Spark on Twitter Public Stream
-
under Apache License 2.0 license
-
Materials for IBM Spark contest. About the real-world application of big data and spark.
-
under MIT License license
-
Spark and Python (PySpark) Examples
-
under MIT License license
-
Bayesian Personalized Ranking for Spark
-
under Apache License 2.0 license
-
Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
-
under Apache License 2.0 license
-
Spark Modularized View
-
under Apache License 2.0 license
-
Generic Implementation of Consensus ADMM over Spark
-
under MIT License license
-
A Python client for Apache Livy, enabling use of remote Apache Spark clusters.
-
under MIT License license
-
Data and code for "Fast Data Applications with Spark and Python"
-
under Apache License 2.0 license
-
Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
-
under GNU General Public License v3.0 license
-
Tools and conversion scripts for the CHM-T36VA Desktop Pick and Place from SparkFun
-
under GNU Affero General Public License v3.0 license
-
Python library for converting Apache Spark ML pipelines to PMML
-
under MIT License license
-
Quickstart PySpark with Anaconda on AWS/EMR
-
under MIT License license
-
Quickstart PySpark with Anaconda on AWS/EMR
-
under Apache License 2.0 license
-
pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4
-
under MIT License license
-
:sparkles: Runs standard --fix against the javascript in your ST3 window on save or manually.
-
under BSD 2-Clause "Simplified" License license
-
Troposphere-based environment generator
-
under MIT License license
-
-
under GNU General Public License v3.0 license
-
dataShark is a Security & Network Event Analytics Framework built on Apache Spark
-
under MIT License license
-
pytest plugin to run the tests with support of pyspark
-
under Apache License 2.0 license
-
:sparkles: My personal blog :sparkles:
-
under Apache License 2.0 license
-
-
under MIT License license
-
本库托管了协程、SMTP邮件发送协议、 Python连接远程HBase、 异步爬虫代码和快速上手中英文词云图等代码,如果你觉得对你有用,别忘了star我哦。
-
under Apache License 2.0 license
-
GPU Acceleration for Apache Spark
-
under MIT License license
-
Python Library to Interface to Cisco Spark REST API
-
under MIT License license
-
Implementation of a LSTM with TensorFlow and distributed on Apache Spark
-
under MIT License license
-
Used Spark core python, Spark sql, Spark MLlib, Spark Streaming
-
under Apache License 2.0 license
-
Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)
-
under MIT License license
-
Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js
-
under Apache License 2.0 license
-
Spark GCE Script Helps you deploy Spark cluster on Google Cloud.
-
under MIT License license
-
Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag
-
under MIT License license
-
Train and run Pytorch models on Apache Spark.
-
under MIT License license
-
A simple tool for plotting Spark ML's Decision Trees
-
under Apache License 2.0 license
-
Utilities and examples to asssist in working with PySpark and Cassandra.
-
under GNU General Public License v3.0 license
-
pyspark sample scripts
-
under BSD 3-Clause "New" or "Revised" License license
-
A pure python mock of pyspark's RDD
-
under MIT License license
-
Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.
-
under MIT License license
-
Real-time image processing at scale using Kafka and Spark Streaming
-
under Apache License 2.0 license
-
Validator for the stac-spec
-
under GNU Affero General Public License v3.0 license
-
:pencil: Terminal-based crossword puzzle solving interface
-
under Apache License 2.0 license
-
Automates Spark standalone cluster tasks with Puppet and Fabric.
-
under MIT License license
-
Speech-activated LEDs using Intel Edison, SparkFun blocks, Python, and CMU Sphinx
-
under BSD 3-Clause "New" or "Revised" License license
-
Custom field for storing month (YYYY-MM) as a django field.
-
under Apache License 2.0 license
-
Client library to use Darwin services in Python
-
under GNU General Public License v2.0 license
-
The source code involved in complex network analysis.
-
under GNU General Public License v3.0 license
-
Source codes involved in data analysis.
-
under Apache License 2.0 license
-
PySpark for Elastic Search
-
under MIT License license
-
Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
-
under Apache License 2.0 license
-
:star: CLI tool to launch Spark jobs on AWS EMR
-
under Apache License 2.0 license
-
A command line tool for Spark packages
-
under MIT License license
-
This program compiles Ducky Script into something usable for the Digispark ATTiny 85 chip. Get yourself a $1 USB Rubber Ducky.
-
under MIT License license
-
-
under Apache License 2.0 license
-
An Unofficial Pytorch Implementation of Multi-Granularity Hierarchical Attention Fusion Networks for Reading Comprehension and Question Answering
-
under MIT License license
-
Script for Digispark Attiny85, ATMEGA32U4 to steal passwords, cookies and send to your mail
-
under Apache License 2.0 license
-
Joblib Apache Spark Backend
-
under MIT License license
-
Just a boilerplate for PySpark and Flask
-
under Apache License 2.0 license
-
Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)
-
under MIT License license
-
An example Python ALS recommender system
-
under MIT License license
-
Supercharge your analysis of Cassandra data with Apache Spark
-
under GNU General Public License v3.0 license
-
Netmiko-based scripts to assist the Network Administrators and Engineers of the world!
-
under GNU General Public License v2.0 license
-
Poor man's rubber ducky
-
under MIT License license
-
[Maintainer Required] Dialogflow Django is a web client to chat :sparkling_heart:
-
under MIT License license
-
Make awesome reveal.js presentations with revelation :sparkler:
-
under Apache License 2.0 license
-
-
under Apache License 2.0 license
-
Distribution transparent Machine Learning experiments on Apache Spark
-
under Artistic License 2.0 license
-
Spark on ECS
-
under MIT License license
-
Google 日本語入力用DvorakJPローマ字テーブル / DvorakJP Roman Table for Google Japanese Input
-
under MIT License license
-
:sparkles: A Python package for sparse representations and dictionary learning, including matching pursuit, K-SVD and applications.
-
under GNU Lesser General Public License v3.0 license
-
Python module for Spark devices (see spark.io)
-
under MIT License license
-
An Earley-Algorithm Context-free grammar Parser Toolkit
-
under MIT License license
-
使用Python构建共现矩阵,并以三元组形式存储到csv文件。