You will be charged % cancellation fee
|
Please Choose |
Full Order Select Items |
Build flows and apps that convert audio files to text in Microsoft Power Platform and other automation platforms.
The “Speech to Text” action retrieves a transcript from an audio file by recognising the words used in the source based on a user-chosen language.
This action could provide a fast and efficient way for users to automate the process of transcribing or subtitling the content they upload to sites like YouTube.
Parameters
Title | Name | Type | Description |
---|---|---|---|
Language | language | string | Language of speech in file |
File | file | file | Source file |
File (file name) | filename | string | Name of file |
Response
Status | Title | Name | Type | Description |
---|---|---|---|---|
Success | Result | result | file | Transcript of audio file |
Failure | Result | result | string | Error description |
How to convert audio files to text with Microsoft Power Automate
Instructions
NOTE: The Power Platform connector framework imposes a maximum time limit restriction of 2 minutes on all action responses. This severely limits the amount of data that can be processed by the speech to text engine within that timeframe. Our testing has shown that the largest input file that can be processed in this time limit is around 1MB compressed (MP3) or 10MB uncompressed (WAV). Until Microsoft increases the response time restriction, it is advisable to break up larger files into smaller chunks under these limits and split them across multiple actions.
Example
Video
How to convert audio files to text with Microsoft Power Apps
Instructions
NOTE: The Power Platform connector framework imposes a maximum time limit restriction of 2 minutes on all action responses. This severely limits the amount of data that can be processed by the speech to text engine within that timeframe. Our testing has shown that the largest input file that can be processed in this time limit is around 1MB compressed (MP3) or 10MB uncompressed (WAV). Until Microsoft increases the response time restriction, it is advisable to break up larger files into smaller chunks under these limits and split them across multiple actions.
Example
Video
How to convert audio files to text with Nintex
Instructions
Example
Video
Instructions
If your platform is not listed and it supports Open API (Swagger) extensions, import the API Definition document from the Developer Edition product on our Customer Portal at https://portal.apptigent.com/product (look for the Open API link at the top of the PowerTools Developer API definition page). Invoke the desired actions in your app or workflow design tool, supplying values for the listed parameters. Refer to the developer documentation on the Customer Portal for details on input and output formats.
If you are developing a custom app, execute a RESTful POST operation to the /CountCollection endpoint in your application code or use the pre-generated client scaffolding from our Github repo at https://github.com/apptigent/powertools. Be sure to include your API Key (Client ID) in the header using the “X-IBM-Client-Id” key/value pair. The body should be a well-formed JSON object with the parameter label(s) and value(s) in the specified format. Refer to the API documentation at https://portal.apptigent.com for more information.
Example
const request = require('request');
const options = {
method: 'POST',
url: 'https://connect.apptigent.com/api/utilities/SpeechToText',
headers: {
'X-IBM-Client-Id': 'REPLACE_THIS_KEY',
'content-type': 'multipart/form-data; boundary=---011000010111000001101001',
accept: 'application/json'
},
formData: {
language: 'Finnish (Finland)',
file: ''
}
};
request(options, function (error, response, body) {
if (error) throw new Error(error);
console.log(body);
});