Open Research Initiative

Mapping the DNA of humor

An open taxonomy, dataset, and set of benchmarks to help AI understand why something is funny — across cultures, formats, and contexts. A project incubated by Midtown Show.

Join the early contributors Read the overview

Dataset

Cross-cultural comedy corpus

Curated clips, transcripts, and annotations labeled by humor type, setup→punch structure, and audience reaction timing.

Taxonomy

Structured humor ontology

A research-backed schema unifying comedic devices (irony, misdirection, call-back), formats, tones, and social context.

Benchmarks

Evaluation & leaderboards

Standard tasks for detection, generation, and alignment — with reproducible protocols and baselines.

Community

Open collaboration

Researchers, comedians, and curators co-designing annotation guidelines and culturally aware test sets.

humor-types: misdirection, contrast, wordplay, character, meta, deadpan… formats: stand-up, sketch, improv, memes, sitcoms… signals: laugh-timing, applause, groans, silence

Why now

AI can code and compose — but still struggles with comedy.

Humor is context, culture, and timing. We’re building shared infrastructure — a transparent taxonomy, open datasets, and strong benchmarks — so models can learn structure, not just style. Our north star: safer, more respectful, culturally aware comedic understanding that supports human creators.

Tracks

HG-Taxonomy

Consensus-driven schema for humor categories and devices with clear annotation rules and edge-case guidance.

HG-Corpus

Multilingual, multi-format dataset with clips, transcripts, and crowd+expert labels. Opt-in ethics and creator credit.

HG-Bench

Detection, classification, and generation tasks; leaderboards and baselines for fair comparison.

HG-Tools

Open-source scripts for segmentation, laugh-detection, and timing alignment; lightweight SDK for experiments.

Get involved

Researchers, comedians, and curators — we’re building this together.

Sign up for early contributor updates, annotation pilots, and kickoff calls. We’ll also announce the initial taxonomy draft and submission guidelines here.

Backed by Midtown Show, a New York Benefit Corporation.