/

data

sql
data-science
react
data-analysis
python
data-engineering
javascript
csv
pipeline
data-visualization
query
database
visualization
data-integration
elt
etl
machine-learning
fetch
analytics
bi
json
workflow
orchestration
data-pipeline
data-collection
datasets
kubernetes
golang
hooks
cache
stale-while-revalidate
rest

TanStack/query
505日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

metabase/metabase
503日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

SheetJS/sheetjs
504日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

run-llama/llama_index
501日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
502日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
501日前15.4k

The official home of the Presto distributed SQL query engine for big data

PrefectHQ/prefect
501日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

airbytehq/airbyte
502日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

faker-js/faker
501日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
501日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Sinaptik-AI/pandas-ai
502日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

PRQL/prql
501日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
501日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
501日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

akfamily/akshare
502日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

mrdbourke/machine-learning-roadmap
501日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
502日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

DataEngineer-io/data-engineer-handbook
501日前6.6k

This is a repo with links to everything you'd ever want to learn about data engineering

mage-ai/mage-ai
501日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

cloudquery/cloudquery
503日前5.4k

The open source high performance data integration platform built for developers.

kestra-io/kestra
501日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

flyteorg/flyte
501日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

jonschlinkert/gray-matter
502日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

heroku/react-refetch
508日前3.4k

A simple, declarative, and composable way to fetch data for React components

glideapps/glide-data-grid
503日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

uber/aresdb
513日前3.0k

A GPU-powered real-time analytics storage and query engine.

kayak/pypika
501日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.

Kanaries/graphic-walker
502日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

keajs/kea
505日前1.9k

Batteries Included State Management for React

mahmoud/glom
512日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

diffgram/diffgram
504日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
504日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

JuliaData/DataFrames.jl
504日前1.7k

In-memory tabular data in Julia

dataliterate/data-populator
518日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

chartshq/muze
525日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

rilldata/rill
501日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.