The data engineering community in 2026 splits across discipline lines that did not exist in 2020. Analytics engineers cluster around dbt and the modern data stack. ML engineers cluster around MLOps and the model-serving ecosystem. AI engineers cluster around LLM-applied tools and the Latent Space orbit. Data engineers proper cluster around the data infrastructure layer that underlies all three.
The map below covers the major communities in each segment, with approximate member counts where public, the discussion vibe, and what each community values in vendor participation. The communities are not mutually exclusive: most senior data engineers participate in 2 to 5 of these at once.
Slacks and Discords
dbt Community Slack (~50,000 members)
The largest analytics-engineering community by far. Run by dbt Labs but
the discussion is genuinely cross-vendor. Strong for analytics engineer and
DE-adjacent hiring and tool discussion. Channels include
#dbt-best-practices, #analytics-engineering,
#tools-and-utilities, plus city-specific channels and
language-specific channels. Marketing rules: paid sponsorships in
#i-made-this and tool-launch channels are accepted; cold
DMs are discouraged.
MLOps Community Slack (~30,000 members)
The largest production-ML community. Strong for MLE and DE-MLE overlap
roles. Channels include #mlops, #feature-stores,
#model-serving, #llm-ops. Run by Demetrios
Brinkmann. Active podcast and conference (MLOps World) attached. Marketing
rules: sponsorships and featured placements are paid and structured;
organic participation precedes successful product placement.
Locally Optimistic
Analytics-DS community, smaller than dbt Slack but very high signal. Centered on the Locally Optimistic blog and Slack. Strong for analytics engineer and analytics DS hiring. Distinct vibe: skeptical of vendor marketing, values practitioner-to-practitioner content.
Latent Space Discord
The center of LLM-applied AI engineering culture in 2026. Active channels for prompt engineering, agents, evaluation, and RAG patterns. Podcast and AI Engineer Summit conferences attached. Marketing rules: paid sponsorships in newsletter and podcast; community channels are organic-only.
Eleuther AI Discord
Research-leaning ML and LLM community. Stronger on training, finetuning, and open-source model work than on LLM-applied product work. Active for AI research-leaning hiring; less useful for LLM application hiring.
Nous Research Discord
Open-source LLM research community. Smaller than Eleuther but active. Useful for sourcing applied scientists and research-leaning AI engineers.
r/dataengineering (240K+ members)
The largest data engineering subreddit. Monthly "Who's Hiring?" threads (free posting for companies). Discussion vibe is practical and tool-focused. Strong for remote roles and engineering-culture sells. Vendor marketing rules: paid promoted posts via Reddit Ads, organic posts must follow subreddit rules (no salesy language, no copy-paste JDs).
r/MachineLearning (~3M members)
Largest ML subreddit. Mix of research and applied content. Less useful for production-MLE recruiting (research-skewed audience) but strong for brand-building and content distribution.
r/LocalLLaMA (~250K members)
Local-LLM community, strong on open-source model deployment, hardware, and fine-tuning. Strong audience for AI infrastructure tools.
Newsletters
Data Engineering Weekly (~25,000 subscribers)
The largest data-engineering-specific newsletter. Curated weekly. Strong for vendor sponsorship reaching the DE-IC audience directly.
The Pragmatic Engineer (~700,000 subscribers)
Broader than data engineering but with strong data and infrastructure coverage. Reaches engineering leaders and senior ICs across disciplines. Higher sponsorship cost matches the audience size.
MLOps Community Newsletter
Attached to the Slack and podcast. Strong for MLE-specific vendor sponsorship.
Latent Space Newsletter
The largest AI engineering-flavored newsletter. Attached to the podcast and conference. Strong for AI infrastructure vendor sponsorship.
Benn Stancil's Substack
Analytics and modern data stack commentary. Smaller audience than DE Weekly but very high signal for analytics-engineering vendor sponsorship.
Joe Reis on Data
Co-author of "Fundamentals of Data Engineering." Newsletter and content reach senior DE practitioners directly.
Tristan Handy's Substack
dbt Labs founder. Reaches the analytics-engineering leadership audience. Smaller than Pragmatic Engineer but extremely high signal.
Podcasts
Data Engineering Podcast
Longest-running data engineering podcast (started 2017). Senior IC and engineering leadership audience. Strong for vendor sponsorship.
Analytics Engineering Podcast
Attached to dbt Labs. Analytics engineering audience.
MLOps Community Podcast
Production-MLE and MLOps audience. Attached to the Slack and conference.
Latent Space Podcast
AI engineering audience. Co-hosts swyx and Alessio. Strong vendor sponsorship traction with AI infrastructure tools.
The Data Stack Show
Tooling-focused data engineering podcast. Vendor-friendly format.
Conferences
Data Council
The largest independent data engineering conference (2,000-3,000 attendees). Strong speaking-slot opportunity for DE-focused vendors.
dbt Coalesce
dbt Labs' annual conference. Analytics-engineering focused. Strong for vendor sponsorship in the analytics-DE space.
Snowflake Summit / Databricks Data + AI Summit
Vendor-led conferences with 10,000+ attendees each. Strong for ecosystem-adjacent vendor sponsorship.
Subsurface
Dremio's lakehouse-focused conference. Strong for data lake and table format vendors.
MLOps World
The MLOps Community's annual conference. Production-MLE focused.
AI Engineer Summit
Attached to Latent Space. The largest LLM-applied engineering conference. Strong for AI infrastructure vendor sponsorship.
NeurIPS / ICML / ICLR
Academic ML conferences. Research-leaning audience. Useful for research- AI engineer recruiting and academic-flavored vendor brand-building.
Other (cross-cutting)
Hacker News
Not a data-specific community but functions as the cross-cutting layer where data engineers consume technical content and discuss tool launches. Show HN is the primary launch channel for data infrastructure tools.
GitHub
OSS contributions are a primary discovery mechanism for data infrastructure tools. Star counts on relevant projects (Airflow, dbt-core, DuckDB, Polars) signal both adoption and credibility.
LinkedIn (engineering leadership audience)
The data engineering leadership audience is active on LinkedIn (managers, directors, VPs, CTOs). The IC audience is less so. Use LinkedIn for leadership-targeted content; use other channels for IC-targeted content.