Tech blog
-
2025.12.01What is speech segment detection?
Are you familiar with "voice activity detection", a system that detects only "where a person is speaking"? Not only does it filter out noise and hold music, improving recognition accuracy, but it also has the benefit of "paying only for the time you're speaking". We'll explain in an easy-to-understand way a new perspective on choosing a speech recognition service.
-
2025.03.11Prerequisites for developing systems using speech recognition <Part 2> - Development Know-how Series 2 -
From the perspective of a speech recognition system developer, we will explain useful knowledge to know before introducing speech recognition. This time, we will cover the prerequisites for introducing speech recognition (Part XNUMX).
-
2025.03.11Prerequisites for developing systems using speech recognition <Part 1> - Development Know-how Series 1 -
When developing a system that uses speech recognition, do you have concerns about recognition accuracy or how to control it so that users can use it effectively? Here are some useful prerequisites to know before developing a speech recognition system.
-
2025.01.30How far can dialect speech recognition go? The case of AmiVoice
Even though they are dialects, there are differences in words and phrases, and in intonation and accent. We will explain what and how much AmiVoice recognizes.
-
2024.12.18Learn from scratch! How to measure speech recognition accuracy (speech recognition rate)
The developer of AmiVoice will explain in detail how to measure the speech recognition rate, which is used to check how accurately the speech is being recognized. This is useful as an indicator of the performance of a speech recognition engine, and is useful during development and when comparing speech recognition engines.
-
2024.01.24How to measure speech recognition accuracy (speech recognition rate) using jiwer
This article explains how to measure speech recognition accuracy using a Python library called jiwer.
-
2023.08.30Differences and features between hybrid speech recognition and end-to-end speech recognition
We will explain the differences and features between hybrid speech recognition and end-to-end speech recognition, and also explain the method used by Advanced Media, taking into account the features of each.
-
2023.03.06A quick explanation of how speech recognition works!
An engineer involved in research into speech recognition will provide a rough and easy-to-understand explanation of how speech is converted into text, the mechanisms and types of speech recognition, the features of each, and how to choose the appropriate engine.
-
2023.01.16A system for transcribing contact center calls using voice recognition (SIP edition)
We will introduce a system that uses voice recognition in contact centers to automatically transcribe calls. This is a method of capturing call audio using the SIP protocol.
-
2022.11.29What sampling rate is necessary for speech recognition?
This article explains the optimal sampling rate for speech recognition. It also provides a detailed explanation of how to handle sampling rates with the AmiVoice API.
-
2022.05.30What is "orthographic variation" in speech recognition?
In speech recognition, "orthographic variation" can occur between the recognition result and the correct sentence. This article explains what "orthographic variation" is and to what extent it occurs in actual speech recognition.
-
2022.03.14When can the speech recognition rate be negative?
If there are many errors, the accuracy of speech recognition will decrease, but what is the minimum value? In fact, the recognition accuracy may not only be 0%, but may even be negative. We will introduce such an example and the related perspective of "Whose voice do you want to recognize?"
Most viewed articles
- A quick explanation of how speech recognition works!
- Comparing the speech recognition rates of OpenAI's Whisper and AmiVoice for "conference" audio
- How to use the AmiVoice API free coupon
New articles
- How to use Zenn Coupon & Trial
- How to use coupons for Zenn Spring 2026
- "Speech segment ratio" as seen in operational data
Category list
- Introduction to Speech Recognition (15)
- How to improve voice recognition accuracy (12)
- I tried developing it (27)
- How to use AmiVoiceAPI(27)
- Comparison and Verification (6)
- Others(10)
