Rapidata order manager
RapidataOrderManager #
Handels everything regarding the orders from creation to retrieval.
Attributes:
Name | Type | Description |
---|---|---|
filters |
RapidataFilters
|
The RapidataFilters instance. |
settings |
RapidataSettings
|
The RapidataSettings instance. |
selections |
RapidataSelections
|
The RapidataSelections instance. |
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_classification_order #
create_classification_order(
name: str,
instruction: str,
answer_options: list[str],
datapoints: list[str],
data_type: str = MEDIA,
responses_per_datapoint: int = 10,
contexts: list[str] | None = None,
media_contexts: list[str] | None = None,
validation_set_id: str | None = None,
confidence_threshold: float | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a classification order.
With this order you can have a datapoint (image, text, video, audio) be classified into one of the answer options. Each response will be exactly one of the answer options.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. (Will not be shown to the labeler) |
required |
instruction
|
str
|
The instruction for how the data should be classified. |
required |
answer_options
|
list[str]
|
The list of options for the classification. |
required |
datapoints
|
list[str]
|
The list of datapoints for the classification - each datapoint will be labeled. |
required |
data_type
|
str
|
The data type of the datapoints. Defaults to RapidataDataTypes.MEDIA. Other option: RapidataDataTypes.TEXT ("text"). |
MEDIA
|
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
contexts
|
list[str]
|
The list of contexts for the classification. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction and options. (Therefore will be different for each datapoint) Will be match up with the datapoints using the list index. |
None
|
media_contexts
|
list[str]
|
The list of media contexts for the classification i.e links to the images / videos. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction and options. (Therefore will be different for each datapoint) |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
confidence_threshold
|
float
|
The probability threshold for the classification. Defaults to None. If provided, the classification datapoint will stop after the threshold is reached or at the number of responses, whatever happens first. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the classification. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the classification. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the classification. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the classification. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_compare_order #
create_compare_order(
name: str,
instruction: str,
datapoints: list[list[str]],
data_type: str = MEDIA,
responses_per_datapoint: int = 10,
contexts: list[str] | None = None,
media_contexts: list[str] | None = None,
validation_set_id: str | None = None,
confidence_threshold: float | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a compare order.
With this order you compare two datapoints (image, text, video, audio) and the annotators will choose one of the two based on the instruction.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. (Will not be shown to the labeler) |
required |
instruction
|
str
|
The instruction for the comparison. Will be shown along side each datapoint. |
required |
datapoints
|
list[list[str]]
|
Outher list is the datapoints, inner list is the options for the comparison - each datapoint will be labeled. |
required |
data_type
|
str
|
The data type of the datapoints. Defaults to RapidataDataTypes.MEDIA. Other option: RapidataDataTypes.TEXT ("text"). |
MEDIA
|
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
contexts
|
list[str]
|
The list of contexts for the comparison. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) Will be matched up with the datapoints using the list index. |
None
|
media_contexts
|
list[str]
|
The list of media contexts i.e. links to the images / videos for the comparison. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) Will be matched up with the datapoints using the list index. |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
confidence_threshold
|
float
|
The probability threshold for the comparison. Defaults to None. If provided, the comparison datapoint will stop after the threshold is reached or at the number of responses, whatever happens first. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the comparison. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the comparison. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the comparison. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the comparison. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_ranking_order #
create_ranking_order(
name: str,
instruction: str,
datapoints: list[str],
total_comparison_budget: int,
responses_per_comparison: int = 1,
data_type: str = MEDIA,
random_comparisons_ratio: float = 0.5,
context: Optional[str] = None,
validation_set_id: Optional[str] = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
) -> RapidataOrder
Create a ranking order.
With this order you can rank a list of datapoints (image, text, video, audio) based on the instruction. The annotators will be shown two datapoints at a time. The ranking happens in terms of an elo system based on the matchup results.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The question asked from People when They see two datapoints. |
required |
datapoints
|
list[str]
|
A list of datapoints that will participate in the ranking. |
required |
total_comparison_budget
|
int
|
The total number of (pairwise-)comparisons that can be made. |
required |
responses_per_comparison
|
int
|
The number of responses collected per comparison. Defaults to 1. |
1
|
data_type
|
str
|
The data type of the datapoints. Defaults to RapidataDataTypes.MEDIA. Other option: RapidataDataTypes.TEXT ("text"). |
MEDIA
|
random_comparisons_ratio
|
float
|
The fraction of random comparisons in the ranking process. The rest will focus on pairing similarly ranked datapoints. Defaults to 0.5 and can be left untouched. |
0.5
|
context
|
str
|
The context for all the comparison. Defaults to None. If provided will be shown in addition to the instruction for all the matchups. |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the order. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the order. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the order. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_free_text_order #
create_free_text_order(
name: str,
instruction: str,
datapoints: list[str],
data_type: str = MEDIA,
responses_per_datapoint: int = 10,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a free text order.
With this order you can have a datapoint (image, text, video, audio) be labeled with free text. The annotators will be shown a datapoint and will be asked to answer a question with free text.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The instruction to answer with free text. Will be shown along side each datapoint. |
required |
datapoints
|
list[str]
|
The list of datapoints for the free text - each datapoint will be labeled. |
required |
data_type
|
str
|
The data type of the datapoints. Defaults to RapidataDataTypes.MEDIA. Other option: RapidataDataTypes.TEXT ("text"). |
MEDIA
|
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the free text. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the free text. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the free text. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the free text. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_select_words_order #
create_select_words_order(
name: str,
instruction: str,
datapoints: list[str],
sentences: list[str],
responses_per_datapoint: int = 10,
validation_set_id: str | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a select words order.
With this order you can have a datapoint (image, text, video, audio) be labeled with a list of words. The annotators will be shown a datapoint as well as a list of sentences split up by spaces. They will then select specific words based on the instruction.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The instruction for how the words should be selected. Will be shown along side each datapoint. |
required |
datapoints
|
list[str]
|
The list of datapoints for the select words - each datapoint will be labeled. |
required |
sentences
|
list[str]
|
The list of sentences for the select words - Will be split up by spaces and shown along side each datapoint. Must be the same length as datapoints. |
required |
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the select words. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the select words. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the select words. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the select words. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_locate_order #
create_locate_order(
name: str,
instruction: str,
datapoints: list[str],
responses_per_datapoint: int = 10,
contexts: list[str] | None = None,
media_contexts: list[str] | None = None,
validation_set_id: str | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a locate order.
With this order you can have people locate specific objects in a datapoint (image, text, video, audio). The annotators will be shown a datapoint and will be asked to select locations based on the instruction.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The instruction what should be located. Will be shown along side each datapoint. |
required |
datapoints
|
list[str]
|
The list of datapoints for the locate - each datapoint will be labeled. |
required |
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
contexts
|
list[str]
|
The list of contexts for the locate. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) Will be match up with the datapoints using the list index. |
None
|
media_contexts
|
list[str]
|
The list of media contexts for the locate i.e links to the images / videos. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the locate. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the locate. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the locate. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the locate. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_draw_order #
create_draw_order(
name: str,
instruction: str,
datapoints: list[str],
responses_per_datapoint: int = 10,
contexts: list[str] | None = None,
media_contexts: list[str] | None = None,
validation_set_id: str | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a draw order.
With this order you can have people draw lines on a datapoint (image, text, video, audio). The annotators will be shown a datapoint and will be asked to draw lines based on the instruction.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The instruction for how the lines should be drawn. Will be shown along side each datapoint. |
required |
datapoints
|
list[str]
|
The list of datapoints for the draw lines - each datapoint will be labeled. |
required |
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
contexts
|
list[str]
|
The list of contexts for the comparison. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) Will be match up with the datapoints using the list index. |
None
|
media_contexts
|
list[str]
|
The list of media contexts for the draw lines i.e links to the images / videos. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the draw lines. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the draw lines. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the draw lines. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the draw lines. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
create_timestamp_order #
create_timestamp_order(
name: str,
instruction: str,
datapoints: list[str],
responses_per_datapoint: int = 10,
contexts: list[str] | None = None,
media_contexts: list[str] | None = None,
validation_set_id: str | None = None,
filters: Sequence[RapidataFilter] = [],
settings: Sequence[RapidataSetting] = [],
selections: Sequence[RapidataSelection] = [],
private_notes: list[str] | None = None,
) -> RapidataOrder
Create a timestamp order.
Warning
This order is currently not fully supported and may give unexpected results.
With this order you can have people mark specific timestamps in a datapoint (video, audio). The annotators will be shown a datapoint and will be asked to select a timestamp based on the instruction.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order. |
required |
instruction
|
str
|
The instruction for the timestamp task. Will be shown along side each datapoint. |
required |
datapoints
|
list[str]
|
The list of datapoints for the timestamp - each datapoint will be labeled. |
required |
responses_per_datapoint
|
int
|
The number of responses that will be collected per datapoint. Defaults to 10. |
10
|
contexts
|
list[str]
|
The list of contexts for the comparison. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) Will be match up with the datapoints using the list index. |
None
|
media_contexts
|
list[str]
|
The list of media contexts for the timestamp i.e links to the images / videos. Defaults to None. If provided has to be the same length as datapoints and will be shown in addition to the instruction. (Therefore will be different for each datapoint) |
None
|
validation_set_id
|
str
|
The ID of the validation set. Defaults to None. If provided, one validation task will be shown infront of the datapoints that will be labeled. |
None
|
filters
|
Sequence[RapidataFilter]
|
The list of filters for the timestamp. Defaults to []. Decides who the tasks should be shown to. |
[]
|
settings
|
Sequence[RapidataSetting]
|
The list of settings for the timestamp. Defaults to []. Decides how the tasks should be shown. |
[]
|
selections
|
Sequence[RapidataSelection]
|
The list of selections for the timestamp. Defaults to []. Decides in what order the tasks should be shown. |
[]
|
private_notes
|
list[str]
|
The list of private notes for the timestamp. Defaults to None. If provided has to be the same length as datapoints. This will NOT be shown to the labelers but will be included in the result purely for your own reference. |
None
|
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
get_order_by_id #
get_order_by_id(order_id: str) -> RapidataOrder
Get an order by ID.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
order_id
|
str
|
The ID of the order. |
required |
Returns:
Name | Type | Description |
---|---|---|
RapidataOrder |
RapidataOrder
|
The Order instance. |
Source code in src/rapidata/rapidata_client/order/rapidata_order_manager.py
find_orders #
find_orders(
name: str = "", amount: int = 10
) -> list[RapidataOrder]
Find your recent orders given criteria. If nothing is provided, it will return the most recent order.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
The name of the order - matching order will contain the name. Defaults to "" for any order. |
''
|
amount
|
int
|
The amount of orders to return. Defaults to 10. |
10
|
Returns:
Type | Description |
---|---|
list[RapidataOrder]
|
list[RapidataOrder]: A list of RapidataOrder instances. |