/

Cookbook

The Data Engineering Cookbook

最終更新日:502日前
12.7k

tangbc/vue-virtual-scroll-list
507日前4.2k

⚡️A vue component support big amount data list with high render performance and efficient.

donnemartin/data-science-ipython-notebooks
503日前26.1k

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

google/WebFundamentals
503日前13.9k

Former git repo for WebFundamentals on developers.google.com

trekhleb/state-of-the-art-shitcode
503日前5.2k

💩State-of-the-art shitcode principles your project should follow to call it a proper shitcode

h5bp/html5-boilerplate
503日前55.8k

A professional front-end template for building fast, robust, and adaptable web apps or sites.

GokuMohandas/Made-With-ML
503日前35.1k

Learn how to design, develop, deploy and iterate on production-grade ML applications.

goldbergyoni/nodebestpractices
503日前95.3k

:white_check_mark: The Node.js best practices list (February 2024)

xojs/xo
503日前7.5k

❤️ JavaScript/TypeScript linter (ESLint wrapper) with great defaults

cloudquery/cloudquery
503日前5.4k

The open source high performance data integration platform built for developers.

igorwojda/android-showcase
503日前6.3k

💎 Android application following best practices: Kotlin, Coroutines, JetPack, Clean Architecture, Feature Modules, Tests, MVVM, DI, Static Analysis...

apache/spark
502日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

alexeymezenin/laravel-best-practices
503日前10.5k

Laravel best practices

airbytehq/airbyte
502日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

trimstray/nginx-admins-handbook
502日前13.3k

How to improve NGINX performance, security, and other important things.

vesoft-inc/nebula
502日前9.9k

A distributed, fast open-source graph database featuring horizontal scalability and high availability

apache/incubator-hugegraph
503日前2.5k

A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)

apache/zeppelin
502日前6.2k

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

GoogleChrome/lighthouse
502日前27.6k

Automated auditing, performance metrics, and best practices for the web.

catboost/catboost
502日前7.6k

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

h2oai/h2o-3
502日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

apache/flink
502日前22.8k

Apache Flink

apache/predictionio
504日前12.6k

PredictionIO, a machine learning server for developers and ML engineers.

yahoo/CMAK
502日前11.6k

CMAK is a tool for managing Apache Kafka clusters

OWASP/CheatSheetSeries
502日前25.8k

The OWASP Cheat Sheet Series was created to provide a concise collection of high value information on specific application security topics.

ClickHouse/ClickHouse
502日前33.1k

ClickHouse® is a free analytics DBMS for big data

vasanthk/react-bits
502日前16.1k

✨ React patterns, techniques, tips and tricks ✨

inancgumus/learngo
502日前18.2k

❤️ 1000+ Hand-Crafted Go Examples, Exercises, and Quizzes. 🚀 Learn Go by fixing 1000+ tiny programs.

h5bp/server-configs-apache
502日前3.2k

Apache HTTP server boilerplate configs

TuiQiao/CBoard
512日前3.0k

An easy to use, self-service open BI reporting and BI dashboard platform.

evidence-dev/evidence
501日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

dremio/dremio-oss
502日前1.3k

Dremio - the missing link in modern data

traildb/traildb
511日前1.1k

TrailDB is an efficient tool for storing and querying series of events

DataTalksClub/data-engineering-zoomcamp
501日前21.1k

Free Data Engineering course!

PrefectHQ/prefect
501日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

datastacktv/data-engineer-roadmap
502日前11.8k

Roadmap to becoming a data engineer in 2021

mage-ai/mage-ai
501日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

risingwavelabs/risingwave
501日前5.9k

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

kestra-io/kestra
501日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

braziljs/js-the-right-way
501日前8.6k

An easy-to-read, quick reference for JS best practices, accepted coding standards, and links around the Web

OWASP/wstg
501日前6.4k

The Web Security Testing Guide is a comprehensive Open Source guide to testing the security of web applications and web services.

futurice/android-best-practices
502日前20.3k

Do's and Don'ts for Android development, by Futurice developers

h5bp/server-configs-nginx
502日前10.9k

Nginx HTTP server boilerplate configs

futurice/ios-good-practices
508日前10.8k

Good ideas for iOS development, by Futurice developers.

testjavascript/nodejs-integration-tests-best-practices
501日前3.2k

✅ Beyond the basics of Node.js testing. Including a super-comprehensive best practices list and an example app (July 2023)

gothinkster/golang-gin-realworld-example-app
502日前2.4k

Exemplary real world application built with Golang + Gin

prestodb/presto
501日前15.4k

The official home of the Presto distributed SQL query engine for big data

whoiskatrin/sql-translator
502日前3.9k

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.