/

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

最終更新日:625日前
6.6k

TanStack/query
630日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

keajs/kea
630日前1.9k

Batteries Included State Management for React

heroku/react-refetch
633日前3.4k

A simple, declarative, and composable way to fetch data for React components

FredericHeem/starhackit
634日前1.3k

StarHackIt: React/Native/Node fullstack starter kit with authentication and authorisation, data backed by SQL, the infrastructure deployed with GruCloud

bitwarden/server
628日前14.0k

The core infrastructure backend (API, database, Docker, etc).

microsoft/azuredatastudio
630日前7.4k

Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases. Azure Data Studio supports Windows, macOS, and Linux, with immediate capability to connect to Azure SQL and SQL Server. Browse the extension library for more database support options including MySQL, PostgreSQL, and MongoDB.

serhii-londar/open-source-mac-os-apps
628日前38.7k

🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

metabase/metabase
628日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

lk-geimfari/awesomo
629日前9.2k

Cool open source projects. Choose your project and get involved in Open Source development now.

pingcap/tidb
628日前35.7k

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

SheetJS/sheetjs
629日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

cockroachdb/cockroach
628日前28.7k

CockroachDB - the open source, cloud-native distributed SQL database.

electric-sql/electric
629日前4.0k

Local-first sync layer for web and mobile apps. Build reactive, realtime, local-first apps directly on Postgres.

amitshekhariitbhu/Android-Debug-Database
628日前8.3k

A library for debugging android databases and shared preferences - Make Debugging Great Again

OffcierCia/DeFi-Developer-Road-Map
628日前9.3k

DeFi Developer roadmap is a curated Developer handbook which includes a list of the best tools for DApps development, resources and references!

dpgaspar/Flask-AppBuilder
628日前4.5k

Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welcome) - http://flaskappbuilder.pythonanywhere.com/

dypsilon/frontend-dev-bookmarks
628日前40.4k

Manually curated collection of resources for frontend web developers.

ascoders/weekly
628日前26.7k

前端精读周刊。帮你理解最前沿、实用的技术。

ellisonleao/magictools
628日前12.6k

:video_game: :pencil: A list of Game Development resources to make magic happen.

cloudquery/cloudquery
628日前5.4k

The open source high performance data integration platform built for developers.

dkhamsing/open-source-ios-apps
627日前38.3k

:iphone: Collaborative List of Open-Source iOS Apps

apache/spark
627日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

multiprocessio/datastation
629日前2.8k

App to easily query, script, and visualize data from every database, file, and API.

JuliaData/DataFrames.jl
629日前1.7k

In-memory tabular data in Julia

airbytehq/airbyte
627日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Sinaptik-AI/pandas-ai
627日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

borisdj/EFCore.BulkExtensions
628日前3.4k

Entity Framework EF Core efcore Bulk Batch Extensions with BulkCopy in .Net for Insert Update Delete Read (CRUD), Truncate and SaveChanges operations on SQL Server, PostgreSQL, MySQL, SQLite

akfamily/akshare
627日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

OdyseeTeam/chainquery
628日前2.3k

Chainquery parses and syncs the LBRY blockchain data into structured SQL

xlucn/oh-my-foss-android
627日前1.6k

个人收集的实用、良心开源安卓软件

SpaceVim/SpaceVim
628日前20.1k

A community-driven modular vim/neovim distribution - The ultimate vimrc

kaxap/arl
628日前1.9k

lists of most popular repositories for most favoured programming languages (according to StackOverflow)

taosdata/TDengine
627日前22.6k

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.

osquery/osquery
628日前20.9k

SQL powered operating system instrumentation, monitoring, and analytics.

sequelize/sequelize
627日前28.9k

Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
627日前14.7k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

orientechnologies/orientdb
629日前4.7k

OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.

Awesome-HarmonyOS/HarmonyOS
627日前19.0k

A curated list of awesome things related to HarmonyOS. 华为鸿蒙操作系统。

apache/avro
631日前2.7k

Apache Avro is a data serialization system.

pocoproject/poco
627日前7.7k

The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems.

timescale/timescaledb
628日前16.2k

An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

apache/shardingsphere
627日前19.2k

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

knex/knex
627日前18.5k

A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.

Leantime/leantime
628日前3.9k

Leantime is a goals focused project management system for non-project managers. Building with ADHD, Autism, and dyslexia in mind.

apache/flink
627日前22.8k

Apache Flink

cube-js/cube
627日前16.9k

📊 Cube — The Semantic Layer for Building Data Applications

dataliterate/data-populator
643日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

ClickHouse/ClickHouse
627日前33.1k

ClickHouse® is a free analytics DBMS for big data

mybatis/mybatis-3
627日前19.2k

MyBatis SQL mapper framework for Java

forthespada/CS-Books
627日前17.8k

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

drizzle-team/drizzle-orm
627日前17.3k

Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅

DapperLib/Dapper
627日前16.9k

Dapper - a simple object mapper for .Net

turbot/steampipe
627日前6.2k

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

mendel5/alternative-front-ends
627日前5.9k

Overview of alternative open source front-ends for popular internet platforms (e.g. YouTube, Twitter, etc.)

mhinz/vim-galore
627日前16.2k

:mortar_board: All things Vim!

dexteryy/spellbook-of-modern-webdev
627日前16.6k

A Big Picture, Thesaurus, and Taxonomy of Modern JavaScript Web Development

dr5hn/countries-states-cities-database
628日前6.1k

🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities

glideapps/glide-data-grid
628日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

bolshchikov/js-must-watch
627日前13.1k

Must-watch videos about javascript

flyteorg/flyte
626日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

mahmoud/glom
637日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

ygit/swiftui
631日前1.2k

A collaborative list of awesome SwiftUI resources. Feel free to contribute!

shekhargulati/52-technologies-in-2016
630日前7.2k

Let's learn a new technology every week. A new technology blog every Sunday in 2016.

dariubs/GoBooks
626日前15.6k

List of Golang books

mhadidg/software-architecture-books
626日前9.0k

A comprehensive list of books on Software Architecture.

duckdb/duckdb
626日前14.6k

DuckDB is an in-process SQL OLAP Database Management System

phanan/htaccess
627日前12.5k

✂A collection of useful .htaccess snippets.

Kanaries/graphic-walker
627日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

rilldata/rill
626日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

datalens-tech/datalens
628日前1.2k

A modern, scalable analytics system

ankane/blazer
627日前3.8k

Business intelligence made simple

evidence-dev/evidence
626日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

shzlw/poli
633日前1.9k

An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.

diffgram/diffgram
629日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
629日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

PrefectHQ/prefect
626日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

mage-ai/mage-ai
626日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

risingwavelabs/risingwave
626日前5.9k

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

kestra-io/kestra
626日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

chartshq/muze
650日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

slashbaseide/slashbase
627日前1.3k

Modern database IDE for your dev & data workflows. Supports MySQL, PostgreSQL & MongoDB.

cloudera/hue
627日前1.0k

Open source SQL Query Assistant service for Databases/Warehouses

jonschlinkert/gray-matter
627日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

fnc12/sqlite_orm
627日前2.1k

❤️ SQLite ORM light header only library for modern C++

rdbende/Sun-Valley-ttk-theme
626日前1.6k

A gorgeous theme for Tkinter/ttk, based on the Sun Valley visual style ✨

launchbadge/sqlx
626日前11.2k

🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.

run-llama/llama_index
626日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
627日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
626日前15.4k

The official home of the Presto distributed SQL query engine for big data

faker-js/faker
626日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
626日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

PRQL/prql
626日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
626日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
626日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

mrdbourke/machine-learning-roadmap
626日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
627日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

cube2222/octosql
627日前4.7k

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

whoiskatrin/sql-translator
627日前3.9k

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

roapi/roapi
627日前3.0k

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

uber/aresdb
638日前3.0k

A GPU-powered real-time analytics storage and query engine.

adelsz/pgtyped
627日前2.7k

pgTyped - Typesafe SQL in TypeScript

kayak/pypika
626日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.