Rapidata Python SDK#

Get humans to label your data in minutes. Create labeling jobs, compare model outputs, and collect human feedback at scale.

pip install -U rapidata

The SDK has three building blocks: audiences (who labels), job definitions (what to label), and jobs (running it). Below are the common patterns.

Quick example#

ImageVideoAudioText

from rapidata import RapidataClient

client = RapidataClient()

audience = client.audience.get_audience_by_id("aud_MU1GZYoESyO")

job_definition = client.job.create_compare_job_definition(
    name="Example Image Comparison",
    instruction="Which image matches the description better?",
    contexts=["A small blue book sitting on a large red book."],
    datapoints=[["https://assets.rapidata.ai/midjourney-5.2_37_3.jpg",
                "https://assets.rapidata.ai/flux-1-pro_37_0.jpg"]],
)

job = audience.assign_job(job_definition)
job.view()
job.display_progress_bar()
results = job.get_results()
print(results)

from rapidata import RapidataClient

client = RapidataClient()

audience = client.audience.get_audience_by_id("aud_MU1GZYoESyO")

job_definition = client.job.create_compare_job_definition(
    name="Example Video Comparison",
    instruction="Which video fits the description better?",
    contexts=["A group of elephants painting vibrant murals on a city wall."],
    datapoints=[["https://assets.rapidata.ai/0074_sora_1.mp4",
                "https://assets.rapidata.ai/0074_hunyuan_1724.mp4"]],
)

job = audience.assign_job(job_definition)
job.view()
job.display_progress_bar()
results = job.get_results()
print(results)

from rapidata import RapidataClient, LanguageFilter

client = RapidataClient()

audience = client.audience.get_audience_by_id("global")

job_definition = client.job.create_compare_job_definition(
    name="Example Audio Comparison",
    instruction="Which audio clip sounds more natural?",
    datapoints=[["https://assets.rapidata.ai/Chat_gpt.mp3",
                "https://assets.rapidata.ai/ElevenLabs.mp3"]],
)

job = audience.assign_job(job_definition)
job.view()
job.display_progress_bar()
results = job.get_results()
print(results)

from rapidata import RapidataClient, LanguageFilter

client = RapidataClient()

audience = client.audience.get_audience_by_id("global")

job_definition = client.job.create_compare_job_definition(
    name="Example Text Comparison",
    instruction="Which sentence is grammatically more correct?",
    datapoints=[["The children were amazed by the magician's tricks",
                "The children were amusing by the magician's tricks."]],
    data_type="text",
)

job = audience.assign_job(job_definition)
job.view()
job.display_progress_bar()
results = job.get_results()
print(results)

Note

The curated/global audiences get you started quickly. For higher quality results, use a custom audience with qualification examples.

Core workflow#

The SDK is built around three concepts:

Audience --- A group of labelers filtered through qualification examples. Use curated audiences for quick starts or create custom ones for higher quality. Custom Audiences

Job Definition --- Configures what you want labeled: the data, instruction, response format, and quality settings. Parameter Reference

Job --- A running labeling task. Assign a job definition to an audience, monitor progress, and retrieve results. Quick Start

What you can do#

Use case	Description	Guide
Compare	Side-by-side comparison of images, video, audio, or text	Comparison example
Classify	Categorize data with custom labels or Likert scales	Classification example
Locate	Point out objects, artifacts, or regions within an image	Locate example
Draw	Color in objects or regions within an image	Draw example
Select words	Mark the words of a sentence that match an instruction	Select Words example
Free text	Collect free-form text answers from real people	Free Text example
Rank	Order a set of images, videos, or texts via pairwise matchups	Ranking example
Rank models	Benchmark AI models on leaderboards with human evaluation	Model Ranking
Continuous ranking	Lightweight ongoing ranking without full job setup	Ranking Flows