big-data
apache/spark502日前37.8k
Apache Spark - A unified analytics engine for large-scale data processing
ClickHouse/ClickHouse502日前33.1k
ClickHouse® is a free analytics DBMS for big data
donnemartin/data-science-ipython-notebooks503日前26.1k
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
amark/gun502日前17.7k
An open source cybersecurity protocol for syncing decentralized graph data.
prestodb/presto501日前15.4k
The official home of the Presto distributed SQL query engine for big data
andkret/Cookbook502日前12.7k
The Data Engineering Cookbook
apache/predictionio504日前12.6k
PredictionIO, a machine learning server for developers and ML engineers.
yahoo/CMAK502日前11.6k
CMAK is a tool for managing Apache Kafka clusters
vesoft-inc/nebula502日前9.9k
A distributed, fast open-source graph database featuring horizontal scalability and high availability
catboost/catboost502日前7.6k
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
h2oai/h2o-3502日前6.6k
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
apache/zeppelin502日前6.2k
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
risingwavelabs/risingwave501日前5.9k
Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.
feast-dev/feast501日前5.1k
Feature Store for Machine Learning
apache/ignite502日前4.6k
Apache Ignite
tangbc/vue-virtual-scroll-list507日前4.2k
⚡️A vue component support big amount data list with high render performance and efficient.
TuiQiao/CBoard512日前3.0k
An easy to use, self-service open BI reporting and BI dashboard platform.
apache/incubator-hugegraph503日前2.5k
A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)
kantord/just-dashboard502日前1.6k
:bar_chart: :clipboard: Dashboards using YAML or JSON files
traildb/traildb511日前1.1k
TrailDB is an efficient tool for storing and querying series of events