建立音訊藍圖 - Amazon Bedrock

本文為英文版的機器翻譯版本,如內容有任何歧義或不一致之處,概以英文版為準。

建立音訊藍圖

與影像和文件藍圖相比,音訊檔案的藍圖具有幾個獨特的品質。首先,如同影像藍圖,每個 BDA 專案只能有一個音訊藍圖。其次,您無法使用藍圖助理來建立音訊藍圖,而且必須使用其他手動建立選項之一。

以下是音訊處理的一些範例欄位。

音訊檔案的藍圖欄位範例

欄位 指示 擷取類型 Type
transcript_summary Generate a concise abstractive summary of the conversation, focusing on the main topics and key themes. Ensure accuracy by summarizing only what is explicitly discussed, without adding specific details not present in the conversation. Keeping the response within 100 words. inferred string
sentiment_summary A less than 10-word summary of the speakers' sentiments over the course of the audio transcript. Make sure to include changes in sentiment, if they occur. inferred string
topics The main topics of the audio transcript, listed as single words. inferred [string] (Array of strings)
category The category of the audio (not the topic). Choose from General conversation, Media, Hospitality, Speeches, Meetings, Education, Financial, Public sector, Healthcare, Sales, Audiobooks, Podcasts, 911 calls, Other. inferred string
spoken_named_entities Any named entities (typically proper nouns) explicitly mentioned in the audio transcript including locations, brand names, company names, product names, services, events, organizations, etc. Do not include names of people, email addresses or common nouns. extractive [string] (Array of strings)

對話分析的藍圖欄位範例

欄位 指示 擷取類型 Type
call_summary Summarize the caller-agent conversation in under 100 words. Start with the caller's request, then the agent's response and actions, ending with outcomes or follow-ups. Include key details like emails, links, or callbacks. For multiple issues, summarize each with its outcome and next steps. inferred string
call_categories The category (or categories) of the call. Choose one or more from Billing, Tech support, Customer service, Account support, Sales, Complaints, Product issues, Service issues, General inquiries, Other. inferred [string] (Array of strings)
caller_sentiment_summary A 1-3 sentence summary of the caller's sentiment over the course of the call. You must include changes in sentiment. inferred string
spoken_locations Locations explicitly mentioned in the conversation, including cities, states, and countries. extractive [string]
call_opening Did the agent greet the caller and introduce themselves at the beginning of the call? extractive boolean