Tech blog

  • 2026.05.11

    How to use coupons for Zenn Spring 2026

    Hello! This time, we'll explain how to use the free usage coupon for the "AmiVoice API," a speech recognition API service currently available through a campaign being run by Advanced Media on Zenn.

  • "Speech segment ratio" as seen in operational data

    "How much does speech recognition actually cost?" -- To accurately understand the cost of the AmiVoice API, which charges only for the spoken portion, it's crucial to know the actual "speech segment ratio." This article analyzes the distribution of "speech segment ratios" based on real-world operational data. We'll provide concrete figures to give you an idea of ​​the costs for each usage scenario, such as call centers, smartphone apps, and conference recordings. Please use this information to help you estimate costs before implementation.

  • AmiVoice API Update Explanation: New Parameters for Voicebots Reduce Response Wait Times

    Are you familiar with the new AmiVoice API parameters, "recognitionTimeout" and "noInputTimeout," developed based on customer feedback? By utilizing these parameters, you can terminate the processing of unnecessary voice data and control the waiting time during periods of silence, significantly improving the response quality of your voicebot. This article explains how each parameter works and provides specific usage examples.

  • AmiVoice API Update: End-to-End ”Keyword Biasing” Feature

    This article explains the mechanism and practical use of "word emphasis," which is now available in end-to-end speech recognition. It covers the differences in behavior compared to hybrid speech recognition, the key points of setting parameters, and tips for improving accuracy. It's a must-read for anyone who wants to get the most out of E2E.

  • Easily synthesize subtitles into videos! Subtitle workflow created with speech recognition API

    A must-see for anyone who wants to easily add subtitles to videos! We'll show you how to efficiently extract audio, synthesize subtitles, and watch videos in just four steps. Using an actual prime ministerial press conference as a subject, we'll carefully explain the process of generating subtitles using a speech recognition API.

  • AmiVoice's word registration API gives you more freedom in speech recognition!

    One of the features of AmiVoiceAPI is the "word registration function." Did you know there is an API for this function? The word registration API allows for flexible word registration, including integration into apps and management of multiple profiles (dictionaries). This article provides an easy-to-understand explanation of how to use the word registration API and some useful scenarios.

  • Easily develop speech recognition apps with Dify x AmiVoice API

    Why not try using the low-code environment Dify to efficiently implement a speech recognition app without complex code? We'll show you how to intuitively build long-term speech recognition, Slack notifications, and LLM integration using the "AmiVoice API". This content is also useful for non-developers.

  • What is speech segment detection?

    Are you familiar with "voice activity detection", a system that detects only "where a person is speaking"? Not only does it filter out noise and hold music, improving recognition accuracy, but it also has the benefit of "paying only for the time you're speaking". We'll explain in an easy-to-understand way a new perspective on choosing a speech recognition service.

  • Building a serverless web app with AWS

    An AWS infrastructure engineer, inspired by serverless technology, built a web app with a login function using AWS services, Vue3, and in-house GPT. Running costs were as low as 3 yen per month. This practical account provides a realistic look at the appeal and challenges of serverless, including comparisons with legacy configurations and actual costs.

  • 2025.09.08

    We exhibited at "AWS Summit Japan 2025"

    AmiVoice API was exhibited at the "AWS Summit Japan 2025" held in June. It introduced the latest solution that combines speech recognition and generative AI, attracting the attention of many visitors. The exhibit in the "Generative AI Course" tour was particularly successful. Here's a summary of the event.

  • [I made it!] A convenient tool with speech recognition and generation AI - No-code creation with the AmiVoice library and Power Automate

    I wanted to turn conversations with customers at exhibitions into text notes and share them! With that in mind, I created a free, no-code system using iPhone voice memos. This is a practical report on how I used Power Automate Desktop to automate the process, from converting speech to text, to creating titles using generative AI, and posting them to Teams.

Use API for Free