/

data

sql
data-science
react
data-analysis
python
data-engineering
javascript
csv
pipeline
data-visualization
query
database
visualization
data-integration
elt
etl
machine-learning
fetch
analytics
bi
json
workflow
orchestration
data-pipeline
data-collection
datasets
kubernetes
golang
hooks
cache
stale-while-revalidate
rest

TanStack/query
687日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

metabase/metabase
685日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

SheetJS/sheetjs
686日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

run-llama/llama_index
683日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
684日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
683日前15.4k

The official home of the Presto distributed SQL query engine for big data

PrefectHQ/prefect
683日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

airbytehq/airbyte
684日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

faker-js/faker
683日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
683日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Sinaptik-AI/pandas-ai
684日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

PRQL/prql
683日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
683日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
683日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

akfamily/akshare
684日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

mrdbourke/machine-learning-roadmap
683日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
684日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

DataEngineer-io/data-engineer-handbook
683日前6.6k

This is a repo with links to everything you'd ever want to learn about data engineering

mage-ai/mage-ai
683日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

cloudquery/cloudquery
685日前5.4k

The open source high performance data integration platform built for developers.

kestra-io/kestra
683日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

flyteorg/flyte
683日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

jonschlinkert/gray-matter
684日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

heroku/react-refetch
690日前3.4k

A simple, declarative, and composable way to fetch data for React components

glideapps/glide-data-grid
685日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

uber/aresdb
695日前3.0k

A GPU-powered real-time analytics storage and query engine.

kayak/pypika
683日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.

Kanaries/graphic-walker
684日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

keajs/kea
687日前1.9k

Batteries Included State Management for React

mahmoud/glom
694日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

diffgram/diffgram
686日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
686日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

JuliaData/DataFrames.jl
686日前1.7k

In-memory tabular data in Julia

dataliterate/data-populator
700日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

chartshq/muze
707日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

rilldata/rill
683日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.