§ The Bluesky hub
Bluesky

Bluesky
scraper.

No proxy needed. 3 pre-built endpoints, parsed JSON in seconds.

3 endpoints·From $0.003 per call·Median under 2s
§ One call

Send a handle.
Get back JSON.

The profile scraper takes one input: handle. The response carries every public field Bluesky exposes.

Request
cURL
curl https://api.hproxy.com/v1/scrape/bluesky/profile/by/handle \
  -H "Authorization: Bearer $HPROXY_KEY" \
  -H "Content-Type: application/json" \
  -d '{"handle":"espn.com"}'
POST // 1 call · $0.0030 · <2s
Response200 OK
{
  "success": true,
  "did": "did:plc:x7d6j54pm22ufehkes6jo4jf",
  "handle": "espn.com",
  "displayName": "ESPN",
  "avatar": "https://cdn.bsky.app/img/avatar/plain/….jpeg",
  "description": "Serving sports fans. Anytime. Anywhere.",
  "followersCount": 198477,
  "followsCount": 32,
  "postsCount": 659,
  "createdAt": "2024-11-25T21:49:49.345Z"
}
What comes backparsed fields
Bluesky Profile
bluesky_profile
didstring
handlestring
displayNamestring
avatarurl
descriptionstring
followersCountnumber
followsCountnumber
One real call · one structured record
§ Coverage

What you can
actually pull.

§ 01

Profiles

Bio, follower and following counts, post totals, verified flag, links, and the profile image — every public field the platform exposes about an account.

§ 02

Posts

Full post objects: caption text, like and comment counts, view counts, media URLs, and timestamps — pulled by handle or by direct URL.

§ Every endpoint

The full list.
Prices included.

Every line is a live endpoint. Click through for the input schema, response shape, and the curl recipe.

§ What people build

One Bluesky scraper.
Many shapes of work.

§ 01

Audience & creator discovery

Build lead and creator-discovery pipelines from Bluesky public profiles. Filter by follower count, engagement, and region. The JSON is flat enough to drop into a Postgres column without normalisation.

§ 02

Brand & trend monitoring

Track every public post mentioning a brand or keyword across Bluesky. Pipe captions into sentiment models and chart engagement over time without writing a single parser.

§ 03

AI training data

Feed clean, structured Bluesky JSON straight into your model fine-tuning pipeline. Same schema every call — no scraping infrastructure to maintain.

§ 04

Dashboards & reporting

Wire Bluesky signals into internal dashboards. One API key, predictable JSON, dollar-denominated pricing — finance can reason about the bill.

§ Questions

The honest
answers.

The questions that come up before the first call. If yours isn’t here, the founder reads support email himself — just write in.

Do I need proxies to use the Bluesky scraper?

No. Every endpoint runs on our residential pool — fifty million IPs sourced through opt-in partner SDKs. You send an HTTP request with your API key; rotation, retries, and anti-bot handling are ours to worry about.

How does HProxy get past Bluesky’s rate limits and bot detection?

Each call is routed through a residential session that matches typical organic traffic patterns. We don’t advertise specifics; what matters is the result — a 99%+ success rate on public-data endpoints in real customer traffic.

What does the JSON response look like?

Every endpoint returns a flat top-level wrapper (platform, scraper, data, creditsUsed, elapsedMs, requestId) with the parsed Bluesky entity inside data. See the live example near the top of this page for the exact shape.

Is scraping public Bluesky data legal?

Public data scraping (no login, no private content, no PII beyond what bsky.app itself publishes) sits within the boundaries set by hiQ Labs v. LinkedIn and similar US/EU precedents. We don’t serve endpoints that touch authenticated content. You’re responsible for your own use of the data under your local data-protection law (GDPR, CCPA, etc.).

What does each call cost?

From $0.003 per call on the standard endpoints. Endpoints that do heavier upstream work — AI transcripts, posts with media archived to R2 — cost more. The exact per-call price shows on each endpoint row below. One call = one charge, never multiplied by record count.

Can I test before signing up?

Yes — the wallet starts with a $2 deposit that’s refundable for 24 hours. That’s plenty of calls to verify the response shape matches what you’re building across all 3 Bluesky endpoints.

How fast are the responses?

Median response time across the Bluesky endpoints is under 2 seconds. Heavier endpoints (deep comment threads, cursor-paginated lists) can run 4-8 seconds depending on the page depth requested.

§ Try it

Two dollars,
first run.

Sign in with Google, drop $2 in the wallet, fire your first Bluesky call against bsky.app. If it doesn’t do what you need, the balance is refundable for 24 hours.