r/nyc — downvoted but talking

loading...

methodology & data source

Data source: Reddit's public JSON API.
Endpoint: https://www.reddit.com/r/nyc/new.json?limit=100
Paginated via the after cursor. Reddit caps listing pagination at ~1,000 posts (~50–60 days for r/nyc's volume). Each scrape merges into the existing dataset, so the date range grows over time.

What we're looking for: posts that got mass-downvoted but still generated real conversation. Reddit floors post scores at 0, so a score of 0 with dozens of comments means the community actively buried it — but people still showed up to argue.

Inclusion criteria (10+ comments required for all):

Sorting: c/s = comments / max(score, 1). Treating score as 1 when it's 0 so that more-commented zero-score posts rank higher. Click column headers to re-sort.

Limitations:

Source code: scrape.py fetches and merges posts. index.html is this page. No server, no database — just a JSON file and a static page.

score = 0 (mass downvoted) terrible ratio (comments ≥ 5× score) dimmed = long tail (everything else with 10+ comments)
# title score comments c/s
loading data.json...