Stack Overflow / Stack Exchange Scraper

Search Stack Overflow and Stack Exchange by keyword or tags. Get structured questions with score, answers, views, tags, author, and link as JSON or CSV.

Run this in the cloudRun on Apify →

Developer & Research Tools

How it works

1
Open it on Apify
Hit Run on Apify — it opens the tool in the cloud, no install.
2
Set the inputs
Adjust query, site, tags (sensible defaults are pre-filled).
3
Click Run
The tool runs on Apify’s cloud and collects the data for you.
4
Export the results
Download as JSON, CSV or Excel, or pipe straight into your app, Google Sheets, or an AI agent.

Inputs

Field	What it does	Type
`query`	Keywords to search for in question titles and bodies (e.g. "async await", "git rebase conflict"). Can be left empty if you provide one or more Tags instead.	string
`site`	Which Stack Exchange network site to search.	string
`tags`	Comma-separated tags to filter by, e.g. "javascript,promise" or "python,pandas". Optional. A question must carry ALL listed tags. You can search by tags alone with an empty query.	string
`sortBy`	Ordering of results. "votes" = highest score first, "relevance" = best keyword match, "creation" = newest, "activity" = most recently active.	string
`maxItems`	Maximum number of questions to return. The actor paginates the API (100 per page) until this many are collected or there are no more results.	integer
`notionConnector`	Optional. Write each question as a page into your Notion when the run finishes. Authorize a Notion connector once in Settings → API & Integrations → MCP connectors, then pick it here. Leave empty to skip (default) — results are always saved to the dataset regardless.	string
`notionParentId`	Optional. The Notion data source ID of the database to write into (only used if a Notion connector is set). Leave empty to create the pages privately in your workspace instead.	string

What you get

A structured dataset — each result includes fields like:

answerCountbodycreatedAtisAnsweredownerNameownerReputationquestionIdscoretagstitleurlviewCount

Export every run as JSON, CSV or Excel, or send it to your app, a database, Google Sheets, or an AI agent.

2 ready-to-run use cases

Stack Overflow Keyword Search: async/await Questions

Searching Stack Overflow for async/await? Get matching questions ranked by relevance, with score, view counts and answers, exported to JSON or CSV.

Stack Overflow / Stack Exchange Scraper

Search Stack Overflow — or any of the big Stack Exchange sites (Server Fault, Super User, Ask Ubuntu, MathOverflow, Software Engineering) — by keyword and/or tags, and get back clean, structured questions. No API key, no login, no anti-bot.

It talks to the public api.stackexchange.com API directly, so results are fast and reliable. It paginates for you, dedupes by question id, and returns the question body as plain text (HTML stripped).

What you get per question

questionId, title, url, score, answerCount, viewCount, isAnswered, tags, ownerName, ownerReputation, createdAt (ISO), and body (the question text, HTML stripped).

Input

Field	Notes
`query`	Keywords (title + body). Optional if you give `tags`.
`site`	`stackoverflow`, `serverfault`, `superuser`, `askubuntu`, `mathoverflow`, `softwareengineering`.
`tags`	Comma-separated, e.g. `javascript,promise`. A question must carry all of them.
`sortBy`	`votes` (default), `relevance`, `creation`, `activity`.
`maxItems`	How many questions to collect (paginates the API in pages of 100).

You must provide a query or at least one tag.

Output

One dataset row per question. Empty searches return a single diagnostic row and are not charged.

Example

{ "query": "async await", "site": "stackoverflow", "tags": "javascript", "sortBy": "votes", "maxItems": 50 }

Notes

The unauthenticated Stack Exchange API allows ~300 requests/day per IP. The actor respects the API's backoff directive (sleeps when asked) and stops cleanly when the daily quota is exhausted, returning a clear diagnostic instead of failing silently. For very large jobs, split by tag, keyword, or site.

Stack Overflow / Stack Exchange Scraper

How it works

Inputs

What you get

2 ready-to-run use cases

Top Pandas Questions on Stack Overflow by Votes

Stack Overflow Keyword Search: async/await Questions

Stack Overflow / Stack Exchange Scraper

What you get per question

Input

Output

Example

Notes