Data Sources

A data source is the unit of attribution inside a sandbox. Anything feeding content in — an OAuth’d integration, a folder of files, a custom firehose — is registered as a data source first, and every byte ingested through it is tagged with the data_source_id. Sources are the things you list, pause, resume, and disconnect when you want to control what’s flowing into your sandbox.

What you can do

Register a new data source (Pipedream-backed integration, custom firehose, or custom MCP).
Pause a source temporarily (stops ingestion; data stays).
Resume a paused source.
Disconnect a source (revokes provider credentials; keeps the ingested data in your sandbox).
Permanently delete a source (removes the source row and all its ingested data).
Inspect a source’s status, last sync, and adapter config.

Via the Concierge (recommended)

“What data sources do I have?” “Pause my GitHub source.” “Disconnect the Slack source — keep the data.”

For destructive operations (destroy, delete), the Concierge surfaces the equivalent CLI command rather than running it. See Concierge → What it won’t do.

Via the CLI

# Browse and register
copass source list
copass source show <source_id>
copass source register my-firehose --provider custom

# Provision an OAuth-backed integration interactively
copass source provision

# Lifecycle
copass source rename <source_id> "new name"
copass source pause <source_id>
copass source resume <source_id>
copass source disconnect <source_id>
copass source delete <source_id>      # permanent

Via the SDK

// Register a custom source
const source = await client.sources.register(sandboxId, {
  provider: 'custom',
  name: 'my-firehose',
  ingestion_mode: 'manual',
});

// Pause / resume
await client.sources.pause(sandboxId, source.data_source_id);
await client.sources.resume(sandboxId, source.data_source_id);

// Disconnect (keeps data) or delete (permanent)
await client.sources.disconnect(sandboxId, source.data_source_id);
await client.sources.del(sandboxId, source.data_source_id);

Ingestion modes

A data source declares how content flows in:

Mode	Driver	When to use
`manual`	Your code calls ingest when you have new content	Scripted ingestion, batch jobs, ad-hoc pushes
`polling`	Your workers call ingest on a schedule	Scheduled refresh of an external system
`realtime`	Webhook handlers call ingest on provider events	OAuth’d integrations with native webhooks (default for Pipedream)
`batch`	One-shot bulk backfill	Initial loads

All modes use the same wire endpoint — the mode is metadata that tells Copass (and your operators) how the source is expected to be driven.

Common patterns

OAuth integration

Slack, GitHub, Notion, etc. — provisioned with realtime mode out of the box. See Integrations.

Folder mirror

Mirror a local directory into the sandbox with the filesystem driver. See Filesystem driver.

Getting Started

Copass Context

Agents

Collaboration

Account

Developer Tools

Cookbooks

Security

Data Sources

What you can do

Via the Concierge (recommended)

Via the CLI

Via the SDK

Ingestion modes

Common patterns

OAuth integration

Folder mirror

Custom firehose

Custom MCP server

Next steps

Getting Started

Copass Context

Agents

Collaboration

Account

Developer Tools

Cookbooks

Security

Documentation Index

​What you can do

​Via the Concierge (recommended)

​Via the CLI

​Via the SDK

​Ingestion modes

​Common patterns

OAuth integration

Folder mirror

Custom firehose

Custom MCP server

​Next steps

What you can do

Via the Concierge (recommended)

Via the CLI

Via the SDK

Ingestion modes

Common patterns

Next steps