/

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

最終更新日:353日前
6.6k

TanStack/query
357日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

keajs/kea
357日前1.9k

Batteries Included State Management for React

heroku/react-refetch
360日前3.4k

A simple, declarative, and composable way to fetch data for React components

FredericHeem/starhackit
361日前1.3k

StarHackIt: React/Native/Node fullstack starter kit with authentication and authorisation, data backed by SQL, the infrastructure deployed with GruCloud

bitwarden/server
355日前14.0k

The core infrastructure backend (API, database, Docker, etc).

microsoft/azuredatastudio
357日前7.4k

Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases. Azure Data Studio supports Windows, macOS, and Linux, with immediate capability to connect to Azure SQL and SQL Server. Browse the extension library for more database support options including MySQL, PostgreSQL, and MongoDB.

serhii-londar/open-source-mac-os-apps
355日前38.7k

🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

metabase/metabase
355日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

lk-geimfari/awesomo
356日前9.2k

Cool open source projects. Choose your project and get involved in Open Source development now.

pingcap/tidb
355日前35.7k

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

SheetJS/sheetjs
356日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

cockroachdb/cockroach
355日前28.7k

CockroachDB - the open source, cloud-native distributed SQL database.

electric-sql/electric
356日前4.0k

Local-first sync layer for web and mobile apps. Build reactive, realtime, local-first apps directly on Postgres.

amitshekhariitbhu/Android-Debug-Database
355日前8.3k

A library for debugging android databases and shared preferences - Make Debugging Great Again

OffcierCia/DeFi-Developer-Road-Map
355日前9.3k

DeFi Developer roadmap is a curated Developer handbook which includes a list of the best tools for DApps development, resources and references!

dpgaspar/Flask-AppBuilder
355日前4.5k

Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welcome) - http://flaskappbuilder.pythonanywhere.com/

dypsilon/frontend-dev-bookmarks
355日前40.4k

Manually curated collection of resources for frontend web developers.

ascoders/weekly
355日前26.7k

前端精读周刊。帮你理解最前沿、实用的技术。

ellisonleao/magictools
355日前12.6k

:video_game: :pencil: A list of Game Development resources to make magic happen.

cloudquery/cloudquery
355日前5.4k

The open source high performance data integration platform built for developers.

dkhamsing/open-source-ios-apps
354日前38.3k

:iphone: Collaborative List of Open-Source iOS Apps

apache/spark
354日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

multiprocessio/datastation
356日前2.8k

App to easily query, script, and visualize data from every database, file, and API.

JuliaData/DataFrames.jl
356日前1.7k

In-memory tabular data in Julia

airbytehq/airbyte
354日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Sinaptik-AI/pandas-ai
354日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

borisdj/EFCore.BulkExtensions
355日前3.4k

Entity Framework EF Core efcore Bulk Batch Extensions with BulkCopy in .Net for Insert Update Delete Read (CRUD), Truncate and SaveChanges operations on SQL Server, PostgreSQL, MySQL, SQLite

akfamily/akshare
354日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

OdyseeTeam/chainquery
355日前2.3k

Chainquery parses and syncs the LBRY blockchain data into structured SQL

xlucn/oh-my-foss-android
354日前1.6k

个人收集的实用、良心开源安卓软件

SpaceVim/SpaceVim
355日前20.1k

A community-driven modular vim/neovim distribution - The ultimate vimrc

kaxap/arl
355日前1.9k

lists of most popular repositories for most favoured programming languages (according to StackOverflow)

taosdata/TDengine
354日前22.6k

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.

osquery/osquery
355日前20.9k

SQL powered operating system instrumentation, monitoring, and analytics.

sequelize/sequelize
354日前28.9k

Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
354日前14.7k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

orientechnologies/orientdb
356日前4.7k

OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.

Awesome-HarmonyOS/HarmonyOS
354日前19.0k

A curated list of awesome things related to HarmonyOS. 华为鸿蒙操作系统。

apache/avro
358日前2.7k

Apache Avro is a data serialization system.

pocoproject/poco
354日前7.7k

The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems.

timescale/timescaledb
355日前16.2k

An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

apache/shardingsphere
354日前19.2k

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

knex/knex
354日前18.5k

A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.

Leantime/leantime
355日前3.9k

Leantime is a goals focused project management system for non-project managers. Building with ADHD, Autism, and dyslexia in mind.

apache/flink
354日前22.8k

Apache Flink

cube-js/cube
354日前16.9k

📊 Cube — The Semantic Layer for Building Data Applications

dataliterate/data-populator
370日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

ClickHouse/ClickHouse
354日前33.1k

ClickHouse® is a free analytics DBMS for big data

mybatis/mybatis-3
354日前19.2k

MyBatis SQL mapper framework for Java

forthespada/CS-Books
354日前17.8k

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

drizzle-team/drizzle-orm
354日前17.3k

Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅

DapperLib/Dapper
354日前16.9k

Dapper - a simple object mapper for .Net

turbot/steampipe
354日前6.2k

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

mendel5/alternative-front-ends
354日前5.9k

Overview of alternative open source front-ends for popular internet platforms (e.g. YouTube, Twitter, etc.)

mhinz/vim-galore
354日前16.2k

:mortar_board: All things Vim!

dexteryy/spellbook-of-modern-webdev
354日前16.6k

A Big Picture, Thesaurus, and Taxonomy of Modern JavaScript Web Development

dr5hn/countries-states-cities-database
355日前6.1k

🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities

glideapps/glide-data-grid
355日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

bolshchikov/js-must-watch
354日前13.1k

Must-watch videos about javascript

flyteorg/flyte
353日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

mahmoud/glom
364日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

ygit/swiftui
358日前1.2k

A collaborative list of awesome SwiftUI resources. Feel free to contribute!

shekhargulati/52-technologies-in-2016
357日前7.2k

Let's learn a new technology every week. A new technology blog every Sunday in 2016.

dariubs/GoBooks
353日前15.6k

List of Golang books

mhadidg/software-architecture-books
353日前9.0k

A comprehensive list of books on Software Architecture.

duckdb/duckdb
353日前14.6k

DuckDB is an in-process SQL OLAP Database Management System

phanan/htaccess
354日前12.5k

✂A collection of useful .htaccess snippets.

Kanaries/graphic-walker
354日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

rilldata/rill
353日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

datalens-tech/datalens
355日前1.2k

A modern, scalable analytics system

ankane/blazer
354日前3.8k

Business intelligence made simple

evidence-dev/evidence
353日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

shzlw/poli
360日前1.9k

An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.

diffgram/diffgram
356日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
356日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

PrefectHQ/prefect
353日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

mage-ai/mage-ai
353日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

risingwavelabs/risingwave
353日前5.9k

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

kestra-io/kestra
353日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

chartshq/muze
377日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

slashbaseide/slashbase
354日前1.3k

Modern database IDE for your dev & data workflows. Supports MySQL, PostgreSQL & MongoDB.

cloudera/hue
354日前1.0k

Open source SQL Query Assistant service for Databases/Warehouses

jonschlinkert/gray-matter
354日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

fnc12/sqlite_orm
354日前2.1k

❤️ SQLite ORM light header only library for modern C++

rdbende/Sun-Valley-ttk-theme
353日前1.6k

A gorgeous theme for Tkinter/ttk, based on the Sun Valley visual style ✨

launchbadge/sqlx
353日前11.2k

🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.

run-llama/llama_index
353日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
354日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
353日前15.4k

The official home of the Presto distributed SQL query engine for big data

faker-js/faker
353日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
353日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

PRQL/prql
353日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
353日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
353日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

mrdbourke/machine-learning-roadmap
353日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
354日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

cube2222/octosql
354日前4.7k

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

whoiskatrin/sql-translator
354日前3.9k

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

roapi/roapi
354日前3.0k

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

uber/aresdb
365日前3.0k

A GPU-powered real-time analytics storage and query engine.

adelsz/pgtyped
354日前2.7k

pgTyped - Typesafe SQL in TypeScript

kayak/pypika
353日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.