great_expectations
Always know what to expect from your data.
apache/superset508日前56.6k
Apache Superset is a Data Visualization and Data Exploration Platform
nteract/hydrogen507日前3.9k
:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
donnemartin/data-science-ipython-notebooks504日前26.1k
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
recommenders-team/recommenders504日前17.6k
Best Practices on Recommendation Systems
microsoft/computervision-recipes504日前9.2k
Best Practices, code samples, and documentation for Computer Vision.
Netflix/metaflow504日前7.4k
:rocket: Build and manage real-life ML, AI, and data science projects with ease!
gaia-pipeline/gaia505日前5.1k
Build powerful pipelines in any programming language.
jenkins-x/jx505日前4.5k
Jenkins X provides automated CI+CD for Kubernetes with Preview Environments on Pull Requests using Cloud Native pipelines from Tekton
iterative/cml512日前3.9k
♾️ CML - Continuous Machine Learning | CI/CD for ML
streamlit/streamlit504日前30.2k
Streamlit — A faster way to build and share data apps.
gradio-app/gradio504日前26.3k
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
plotly/dash504日前20.1k
Data Apps & Dashboards for Python. No JavaScript Required.
matplotlib/matplotlib504日前18.9k
matplotlib: plotting with Python
ml-tooling/best-of-ml-python504日前15.1k
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
keras-team/keras504日前60.3k
Deep Learning for humans
GokuMohandas/Made-With-ML504日前35.1k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
ray-project/ray504日前29.9k
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
man-group/dtale504日前4.4k
Visualizer for pandas data structures
fastai/fastpages513日前3.5k
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
nteract/papermill504日前5.5k
📚 Parameterize, execute, and analyze notebooks
alan-turing-institute/MLJ.jl505日前1.7k
A Julia machine learning framework
MilesCranmer/PySR503日前1.6k
High-Performance Symbolic Regression in Python and Julia
CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers503日前26.2k
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
ydataai/ydata-profiling504日前11.8k
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
aws/amazon-sagemaker-examples504日前9.3k
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
firmai/industry-machine-learning504日前7.0k
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
scikit-learn/scikit-learn503日前57.4k
scikit-learn: machine learning in Python
pandas-dev/pandas503日前41.2k
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
AMAI-GmbH/AI-Expert-Roadmap503日前27.9k
Roadmap to becoming an Artificial Intelligence Expert in 2022
airbytehq/airbyte503日前13.2k
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
OpenRefine/OpenRefine505日前10.3k
OpenRefine is a free, open source power tool for working with messy data and improving it
Sinaptik-AI/pandas-ai503日前9.8k
Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
statsmodels/statsmodels503日前9.3k
Statsmodels: statistical modeling and econometrics in Python
Yorko/mlcourse.ai504日前9.3k
Open Machine Learning Course
akfamily/akshare503日前7.9k
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
fastai/numerical-linear-algebra504日前9.9k
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
tangyudi/Ai-Learn503日前7.9k
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
stefan-jansen/machine-learning-for-trading503日前11.3k
Code for Machine Learning for Algorithmic Trading, 2nd edition.
polakowo/vectorbt503日前3.6k
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
explosion/spaCy503日前28.3k
💫 Industrial-strength Natural Language Processing (NLP) in Python
piskvorky/gensim504日前15.0k
Topic Modelling for Humans
ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code503日前14.7k
500 AI Machine learning Deep learning Computer vision NLP Projects with code
dair-ai/ML-YouTube-Courses503日前13.5k
📺 Discover the latest machine learning / AI courses on YouTube.
catboost/catboost503日前7.6k
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
h2oai/h2o-3503日前6.6k
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
rasbt/python-machine-learning-book504日前12.1k
The "Python Machine Learning (1st edition)" book code repository and info resource
EpistasisLab/tpot504日前9.4k
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
sktime/sktime503日前7.2k
A unified framework for machine learning with time series
rasbt/python-machine-learning-book-2nd-edition505日前7.0k
The "Python Machine Learning (2nd edition)" book code repository and info resource
alteryx/featuretools504日前7.0k
An open source python library for automated feature engineering
autogluon/autogluon503日前6.8k
AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
d2l-ai/d2l-en503日前20.9k
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
gonglei007/GameDevMind503日前4.0k
最全面的游戏开发技术图谱。帮助游戏开发者们在已知问题上节省时间,省出更多的精力投入到更有创造性的工作中去。
qax-os/excelize503日前16.8k
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
nteract/nteract504日前6.1k
📘 The interactive computing suite for you! ✨
goq/telegram-list504日前4.4k
List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов
flyteorg/flyte502日前4.5k
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
GoogleCloudPlatform/vertex-ai-samples504日前1.2k
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
apache/airflow502日前33.6k
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Kanaries/graphic-walker503日前2.1k
An open source alternative to Tableau. Easily embedded in any web apps.
evidence-dev/evidence502日前2.9k
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..
kantord/just-dashboard503日前1.6k
:bar_chart: :clipboard: Dashboards using YAML or JSON files
javascriptdata/danfojs503日前4.6k
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
lancedb/lance502日前3.1k
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
diffgram/diffgram505日前1.8k
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
eugeneyan/applied-ml502日前25.7k
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
PrefectHQ/prefect502日前14.1k
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
dagster-io/dagster502日前9.7k
An orchestration platform for the development, production, and observation of data assets.
mage-ai/mage-ai502日前6.5k
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Avaiga/taipy502日前5.5k
Turns Data and AI algorithms into production-ready web applications in no time.
kestra-io/kestra502日前5.4k
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
feast-dev/feast502日前5.1k
Feature Store for Machine Learning
Lightning-AI/pytorch-lightning502日前26.2k
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
eriklindernoren/ML-From-Scratch502日前22.9k
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
marimo-team/marimo503日前3.3k
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
pydoit/doit503日前1.8k
task management & automation tool
goplus/gop503日前8.7k
The Go+ programming language is designed for engineering, STEM education, and data science
mahmoud/boltons504日前6.4k
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
PRQL/prql502日前9.1k
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
mrdbourke/machine-learning-roadmap502日前7.1k
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
jina-ai/jina502日前19.7k
☁️ Build multimodal AI applications with cloud-native stack