Request a tool
All toolsMCP serverRequest a toolPlatformsCategories
Stack Overflow / Stack Exchange Scraper icon

Stack Overflow / Stack Exchange Scraper

Search Stack Overflow and Stack Exchange by keyword or tags. Get structured questions with score, answers, views, tags, author, and link as JSON or CSV.

Run this in the cloudRun on Apify →

Developer & Research Tools

How it works

  1. 1
    Open it on Apify

    Hit Run on Apify — it opens the tool in the cloud, no install.

  2. 2
    Set the inputs

    Adjust query, site, tags (sensible defaults are pre-filled).

  3. 3
    Click Run

    The tool runs on Apify’s cloud and collects the data for you.

  4. 4
    Export the results

    Download as JSON, CSV or Excel, or pipe straight into your app, Google Sheets, or an AI agent.

Inputs

FieldWhat it doesType
queryKeywords to search for in question titles and bodies (e.g. "async await", "git rebase conflict"). Can be left empty if you provide one or more Tags instead.string
siteWhich Stack Exchange network site to search.string
tagsComma-separated tags to filter by, e.g. "javascript,promise" or "python,pandas". Optional. A question must carry ALL listed tags. You can search by tags alone with an empty query.string
sortByOrdering of results. "votes" = highest score first, "relevance" = best keyword match, "creation" = newest, "activity" = most recently active.string
maxItemsMaximum number of questions to return. The actor paginates the API (100 per page) until this many are collected or there are no more results.integer
notionConnectorOptional. Write each question as a page into your Notion when the run finishes. Authorize a Notion connector once in Settings → API & Integrations → MCP connectors, then pick it here. Leave empty to skip (default) — results are always saved to the dataset regardless.string
notionParentIdOptional. The Notion data source ID of the database to write into (only used if a Notion connector is set). Leave empty to create the pages privately in your workspace instead.string

What you get

A structured dataset — each result includes fields like:

answerCountbodycreatedAtisAnsweredownerNameownerReputationquestionIdscoretagstitleurlviewCount

Export every run as JSON, CSV or Excel, or send it to your app, a database, Google Sheets, or an AI agent.

2 ready-to-run use cases

Top Pandas Questions on Stack Overflow by Votes

The highest-voted Stack Overflow questions for the pandas tag, ranked by score, with title, votes, views, answers and links. Handy for Python docs and FAQs.

Stack Overflow Keyword Search: async/await Questions

Searching Stack Overflow for async/await? Get matching questions ranked by relevance, with score, view counts and answers, exported to JSON or CSV.

Stack Overflow / Stack Exchange Scraper

Search Stack Overflow — or any of the big Stack Exchange sites (Server Fault, Super User, Ask Ubuntu, MathOverflow, Software Engineering) — by keyword and/or tags, and get back clean, structured questions. No API key, no login, no anti-bot.

It talks to the public api.stackexchange.com API directly, so results are fast and reliable. It paginates for you, dedupes by question id, and returns the question body as plain text (HTML stripped).

What you get per question

questionId, title, url, score, answerCount, viewCount, isAnswered, tags, ownerName, ownerReputation, createdAt (ISO), and body (the question text, HTML stripped).

Input

FieldNotes
queryKeywords (title + body). Optional if you give tags.
sitestackoverflow, serverfault, superuser, askubuntu, mathoverflow, softwareengineering.
tagsComma-separated, e.g. javascript,promise. A question must carry all of them.
sortByvotes (default), relevance, creation, activity.
maxItemsHow many questions to collect (paginates the API in pages of 100).

You must provide a query or at least one tag.

Output

One dataset row per question. Empty searches return a single diagnostic row and are not charged.

Example

{ "query": "async await", "site": "stackoverflow", "tags": "javascript", "sortBy": "votes", "maxItems": 50 }

Notes

The unauthenticated Stack Exchange API allows ~300 requests/day per IP. The actor respects the API's backoff directive (sleeps when asked) and stops cleanly when the daily quota is exhausted, returning a clear diagnostic instead of failing silently. For very large jobs, split by tag, keyword, or site.