Dictionary
Transcript
The data dictionary and definitions for Transcripts
Fields
| Parameter | Type | Description |
|---|---|---|
| transcript_item_id | int | The Aiera-unique identifier for the transcript |
| event_id | int | The Aiera-unique identifier for the parent event |
| event_type * | str | The type of transcript |
| transcript | str | The unadjusted transcription for this segment |
| original | str | The original untranslated text (if transcript was translated) |
| is_translated | bool | Whether this transcript has been translated from another language |
| timestamp | datetime | The start timestamp for this transcription |
| start_ms | int | The millisecond offset for the start of this transcription relative to audio |
| duration_ms | int | The millisecond duration for this transcription segment |
| speaker_id | int | The Aiera-unique identifier of the speaker |
| person_id | int | The Aiera-unique identifier of the linked person entity |
| speaker_type | str | Machine-generated ("dia" or "id") or published ("final") speaker type |
| speaker_name | str | Name of the speaker |
| speaker_title | str | Title of the speaker |
| transcript_section | str | Name of the section to which the transcript segment belongs ("presentation" or "q_and_a") |
| annotated | str | The tag-annotated transcription (deprecated) |
| audio_url | str | The audio excerpt URL for this transcription |
| is_improved | bool | Whether this transcript has been improved by larger secondary ASR |
| is_edited | bool | Whether this transcript has been human-edited/reviewed |
| status | str | The status of this transcript item |
| tags * | obj | JSON object with section and classification tags |
| created | datetime | When this transcript item was created |
| modified | datetime | The last modified date for this transcription |
| linguistics * | obj | JSON object composed of linguistic-extractions, like summaries, etc. |
Possibles
event_type
| Option | Description |
|---|---|
| transcript | A live/raw transcription |
| official_transcript | A human-corrected transcription |
tags
{
"tags": {
"section": "presentation",
"is_presentation": true,
"is_question": false,
"is_answer": false,
"is_disclosure": false
}
}
| Parameter | Type | Description |
|---|---|---|
| section | str | The section name ("presentation" or "q_and_a") |
| is_presentation | bool | Whether this segment is part of the presentation |
| is_question | bool | Whether this segment is a question in Q&A |
| is_answer | bool | Whether this segment is an answer in Q&A |
| is_disclosure | bool | Whether this segment contains disclosure language |
linguistics
{
"summaries": [
{
"title": SUMMARY TITLE,
"summary": [
SUMMARY PARAGRAPHS
],
"model": (zeroshot|bullet),
"type": SUMMARY TYPE (presentation|q_and_a|everything),
"audio_clip": AUDIO CLIP URL,
"video_clip": VIDEO CLIP URL,
"priority": SUMMARY PRIORITY ORDERING,
"created": SUMMARY CREATION DATE,
"modified": LAST SUMMARY MODIFICATION DATE
}, ...
],
"topics": [
{
"topic_id": AIERA-UNIQUE TOPIC ID,
"topic": TOPIC,
"priority": PRIORITY ORDERING
}, ...
],
"auto_tags": [
TAG, ...
],
"sentiment": {
"average": AVERAGE SENTIMENT SCORE (-1 to 1),
"median": MEDIAN SENTIMENT SCORE (-1 to 1),
"minimum": MINIMUM DETECTED SEGMENT SCORE,
"maximum": MAXIMUM DETECTED SEGMENT SCORE
}
}