Tech blog
-
2023.08.30Differences and features between hybrid speech recognition and end-to-end speech recognition
We will explain the differences and features between hybrid speech recognition and end-to-end speech recognition, and also explain the method used by Advanced Media, taking into account the features of each.
-
2023.07.31Accuracy verification included! Introducing a speech recognition engine specialized for specific applications
We will introduce the voice recognition engine provided by AmiVoice API Private, which is specialized for recognizing names, addresses, etc. We also conducted a verification test comparing its accuracy with a general-purpose engine.
-
2023.07.10What is AmiVoice API Private SDK's "Rule Grammar" Recognition?
We will explain the differences between the "Rule Grammar" speech recognition engine provided by AmiVoice API Private and regular dictation recognition, the appropriate usage scenarios, and the advantages and disadvantages of using it.
-
2023.06.26Comparing the speech recognition rates of OpenAI's Whisper and AmiVoice for "conference" audio
We compared the speech recognition accuracy of OpenAI's Whisper and AmiVoice for meeting audio. The results showed that AmiVoice had significantly fewer misrecognitions and was more accurate. We will explain the factors behind this difference, including examples of Whisper's misrecognitions.
-
2023.05.29I implemented microphone recording in a Windows application. The first step in developing a voice recognition application!
This article explains how to implement microphone recording in a Windows application using C#. It also explains how to use the AmiVoice API to perform speech recognition on the recorded audio and display the recognized results via streaming processing.
-
2023.05.15How to choose whether to display or remove unnecessary words (fillers) with AmiVoice API
AmiVoiceAPI has a function to automatically remove unnecessary words (fillers). However, depending on the situation when using speech recognition, it may be better to display the fillers without removing them. This time, we will explain how to control filler removal.
-
2023.04.17How to convert stereo audio files into two mono audio files
When using AmiVoice API to recognize a stereo audio file, only one channel is recognized. Assuming that the right and left channels of a stereo audio file contain different sounds, this article explains how to convert a stereo audio file into two mono audio files using a tool called SoX.
-
2023.04.03Where does the voice data go after it is processed?
We will answer security-related questions frequently asked by customers considering using the AmiVoice API, such as how voice recognition processed audio is managed, and which plan to choose if the audio data contains personal information.
-
2023.03.13Measuring the speech recognition rate of OpenAI's Whisper (AmiVoice VS Whisper)
In September 2022, OpenAI released a speech recognition engine called Whisper. We compared the speech recognition accuracy of Whisper and AmiVoice.
-
2023.03.06A quick explanation of how speech recognition works!
An engineer involved in research into speech recognition will provide a rough and easy-to-understand explanation of how speech is converted into text, the mechanisms and types of speech recognition, the features of each, and how to choose the appropriate engine.
-
2023.02.06Choosing the right microphone is crucial for utilizing voice recognition. Key points for microphone use and speech
To prevent misrecognition and increase the recognition rate, it is important to choose a microphone that is suitable for voice recognition. We will introduce how to choose a microphone, how to use it, and key points for speaking.
-
2023.01.16A system for transcribing contact center calls using voice recognition (SIP edition)
We will introduce a system that uses voice recognition in contact centers to automatically transcribe calls. This is a method of capturing call audio using the SIP protocol.
Most viewed articles
- A quick explanation of how speech recognition works!
- Comparing the speech recognition rates of OpenAI's Whisper and AmiVoice for "conference" audio
- How to use the AmiVoice API free coupon
New articles
- How to use Zenn Coupon & Trial
- How to use coupons for Zenn Spring 2026
- "Speech segment ratio" as seen in operational data
Category list
- Introduction to Speech Recognition (15)
- How to improve voice recognition accuracy (12)
- I tried developing it (27)
- How to use AmiVoiceAPI(27)
- Comparison and Verification (6)
- Others(10)
