HTTP discovery
Use the standard Link header first. Treat custom headers as optional hints for clients that know JobDataPool.
Link: <https://jobdatapool.com/publisher/exampleboard>; rel="alternate"; type="application/json"
X-JobDataPool-Canonical: https://jobdatapool.com/publisher/exampleboard
HTML metadata
Add page-head hints for CMS templates, job detail pages, and crawlers already parsing HTML.
<link rel="alternate"
type="application/json"
href="https://jobdatapool.com/publisher/exampleboard">
<meta name="jobdatapool-canonical"
content="https://jobdatapool.com/publisher/exampleboard">
Verification
Use one token across DNS, metadata, or /.well-known/ so publisher ownership can be confirmed without redesigning job pages.
jobdatapool-verification=abc123
/.well-known/jobdatapool-verification
<meta name="jobdatapool-verification" content="abc123">
Feed descriptor
Expose a compact descriptor for publisher identity, freshness, usage context, and origin attribution.
{
"publisher": "ExampleBoard",
"origin": "https://example.com",
"canonical_feed": "https://jobdatapool.com/publisher/exampleboard",
"refresh_interval": "PT5M",
"allowed_usage": ["aggregation", "analytics", "search"]
}
Preferred access
- Use JobDataPool APIs or publisher feeds first.
- Refresh origin job pages only when records are missing or stale.
- Respect publisher rate limits, cache headers, and crawl-delay hints where supplied.
- Preserve source attribution, application URLs, and publisher context.
Standards notes
- RFC 9309 defines robots access rules, but robots rules are not access authorization.
- RFC 8288 is the right home for typed alternate links.
- Use comments instead of inventing unsupported
robots.txt directives.
- Use valid robots product tokens such as
JobPoolBot, not dotted domains.