Open Research Initiative

Mapping the DNA of humor

An open taxonomy, dataset, and set of benchmarks to help AI understand why something is funny — across cultures, formats, and contexts. A project incubated by Midtown Show.

Join the early contributors Read the overview
Dataset

Cross-cultural comedy corpus

Curated clips, transcripts, and annotations labeled by humor type, setup→punch structure, and audience reaction timing.

Taxonomy

Structured humor ontology

A research-backed schema unifying comedic devices (irony, misdirection, call-back), formats, tones, and social context.

Benchmarks

Evaluation & leaderboards

Standard tasks for detection, generation, and alignment — with reproducible protocols and baselines.

Community

Open collaboration

Researchers, comedians, and curators co-designing annotation guidelines and culturally aware test sets.

humor-types: misdirection, contrast, wordplay, character, meta, deadpan… formats: stand-up, sketch, improv, memes, sitcoms… signals: laugh-timing, applause, groans, silence

Why now

AI can code and compose — but still struggles with comedy.

Humor is context, culture, and timing. We’re building shared infrastructure — a transparent taxonomy, open datasets, and strong benchmarks — so models can learn structure, not just style. Our north star: safer, more respectful, culturally aware comedic understanding that supports human creators.

Tracks

HG-Taxonomy

Consensus-driven schema for humor categories and devices with clear annotation rules and edge-case guidance.

HG-Corpus

Multilingual, multi-format dataset with clips, transcripts, and crowd+expert labels. Opt-in ethics and creator credit.

HG-Bench

Detection, classification, and generation tasks; leaderboards and baselines for fair comparison.

HG-Tools

Open-source scripts for segmentation, laugh-detection, and timing alignment; lightweight SDK for experiments.

Get involved

Researchers, comedians, and curators — we’re building this together.

Sign up for early contributor updates, annotation pilots, and kickoff calls. We’ll also announce the initial taxonomy draft and submission guidelines here.

No spam. 1–2 updates per month.
Backed by Midtown Show, a New York Benefit Corporation.