/

great_expectations

Always know what to expect from your data.

最終更新日:502日前
9.3k

nteract/hydrogen
507日前3.9k

:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.

donnemartin/data-science-ipython-notebooks
504日前26.1k

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

jenkins-x/jx
505日前4.5k

Jenkins X provides automated CI+CD for Kubernetes with Preview Environments on Pull Requests using Cloud Native pipelines from Tekton

streamlit/streamlit
504日前30.2k

Streamlit — A faster way to build and share data apps.

gradio-app/gradio
504日前26.3k

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

matplotlib/matplotlib
504日前18.9k

matplotlib: plotting with Python

ml-tooling/best-of-ml-python
504日前15.1k

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

GokuMohandas/Made-With-ML
504日前35.1k

Learn how to design, develop, deploy and iterate on production-grade ML applications.

ray-project/ray
504日前29.9k

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

nteract/papermill
504日前5.5k

📚 Parameterize, execute, and analyze notebooks

CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
503日前26.2k

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

aws/amazon-sagemaker-examples
504日前9.3k

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

firmai/industry-machine-learning
504日前7.0k

A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)

scikit-learn/scikit-learn
503日前57.4k

scikit-learn: machine learning in Python

pandas-dev/pandas
503日前41.2k

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

AMAI-GmbH/AI-Expert-Roadmap
503日前27.9k

Roadmap to becoming an Artificial Intelligence Expert in 2022

airbytehq/airbyte
503日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

OpenRefine/OpenRefine
505日前10.3k

OpenRefine is a free, open source power tool for working with messy data and improving it

Sinaptik-AI/pandas-ai
503日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

akfamily/akshare
503日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

fastai/numerical-linear-algebra
504日前9.9k

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

tangyudi/Ai-Learn
503日前7.9k

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
503日前14.7k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

dair-ai/ML-YouTube-Courses
503日前13.5k

📺 Discover the latest machine learning / AI courses on YouTube.

catboost/catboost
503日前7.6k

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

h2oai/h2o-3
503日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

hadley/r4ds
504日前4.3k

R for data science: a book

rasbt/python-machine-learning-book
504日前12.1k

The "Python Machine Learning (1st edition)" book code repository and info resource

rasbt/python-machine-learning-book-2nd-edition
505日前7.0k

The "Python Machine Learning (2nd edition)" book code repository and info resource

d2l-ai/d2l-en
503日前20.9k

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

gonglei007/GameDevMind
503日前4.0k

最全面的游戏开发技术图谱。帮助游戏开发者们在已知问题上节省时间,省出更多的精力投入到更有创造性的工作中去。

qax-os/excelize
503日前16.8k

Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets

goq/telegram-list
504日前4.4k

List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов

flyteorg/flyte
502日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

GoogleCloudPlatform/vertex-ai-samples
504日前1.2k

Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud

Kanaries/graphic-walker
503日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

evidence-dev/evidence
502日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

javascriptdata/danfojs
503日前4.6k

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

lancedb/lance
502日前3.1k

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

diffgram/diffgram
505日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

PrefectHQ/prefect
502日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

mage-ai/mage-ai
502日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

kestra-io/kestra
502日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

Lightning-AI/pytorch-lightning
502日前26.2k

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

eriklindernoren/ML-From-Scratch
502日前22.9k

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

marimo-team/marimo
503日前3.3k

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

goplus/gop
503日前8.7k

The Go+ programming language is designed for engineering, STEM education, and data science

mahmoud/boltons
504日前6.4k

🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

PRQL/prql
502日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

mrdbourke/machine-learning-roadmap
502日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.