Tech blog
-
2026.01.30Easily synthesize subtitles into videos! Subtitle workflow created with speech recognition API
A must-see for anyone who wants to easily add subtitles to videos! We'll show you how to efficiently extract audio, synthesize subtitles, and watch videos in just four steps. Using an actual prime ministerial press conference as a subject, we'll carefully explain the process of generating subtitles using a speech recognition API.
-
2025.12.22Easily develop speech recognition apps with Dify x AmiVoice API
Why not try using the low-code environment Dify to efficiently implement a speech recognition app without complex code? We'll show you how to intuitively build long-term speech recognition, Slack notifications, and LLM integration using the "AmiVoice API". This content is also useful for non-developers.
-
2025.10.28Building a serverless web app with AWS
An AWS infrastructure engineer, inspired by serverless technology, built a web app with a login function using AWS services, Vue3, and in-house GPT. Running costs were as low as 3 yen per month. This practical account provides a realistic look at the appeal and challenges of serverless, including comparisons with legacy configurations and actual costs.
-
2025.08.27[I made it!] A convenient tool with speech recognition and generation AI - No-code creation with the AmiVoice library and Power Automate
I wanted to turn conversations with customers at exhibitions into text notes and share them! With that in mind, I created a free, no-code system using iPhone voice memos. This is a practical report on how I used Power Automate Desktop to automate the process, from converting speech to text, to creating titles using generative AI, and posting them to Teams.
-
2025.07.02[AmiVoice API Private SDK] How to write specific rule grammar from an application developer's perspective (Numerical Input Edition)
AmiVoice's "Rule Grammar" recognizes only expressions that follow predetermined grammar rules. We will use numerical input as an example to show how it can be used in actual application development.
-
2025.05.20[AmiVoice API Private SDK] Creating a "Rule Grammar" for Advanced Users [Advanced Edition]
We will explain advanced techniques for "rule grammar" that can be used with AmiVoice API Private SDK, such as "repetition," "private rules," and "tags."
-
2024.03.07[AmiVoice API Private SDK] Creating a Practical "Rule Grammar" [Practical Edition]
This article explains how to recognize multiple words and other aspects of the "rule grammar" that can be used with AmiVoice API Private SDK, with the aim of enabling you to write more practical rule grammars.
-
2023.12.20[AmiVoice API Private SDK] Creating a simple "Rule Grammar" [Basic Edition]
This article explains the basics of how to write "rule grammar" that can be used with AmiVoice API Private SDK, with the goal of helping you write simple rule grammar.
-
2023.11.30[For Beginners] Running AmiVoice API from Edge and Chrome - Chrome Extension Edition
We will introduce a sample Chrome extension that runs the WebSocket speech recognition API and how to create it.
-
2023.10.23[For Beginners] Running the AmiVoice API from Edge and Chrome Web Page Edition
We will introduce a sample web page that runs the AmiVoice API from Microsoft Edge and Google Chrome, and how to create it.
-
2023.05.29I implemented microphone recording in a Windows application. The first step in developing a voice recognition application!
This article explains how to implement microphone recording in a Windows application using C#. It also explains how to use the AmiVoice API to perform speech recognition on the recorded audio and display the recognized results via streaming processing.
-
2022.10.18[RPA] Convert PDF invoices to text using PAD. Avoid the pitfalls of using JavaScript and regular expressions.
We'll show you how to convert invoice PDFs into text using PAD, as well as some tricky points to avoid when running JavaScript!
Most viewed articles
- A quick explanation of how speech recognition works!
- Comparing the speech recognition rates of OpenAI's Whisper and AmiVoice for "conference" audio
- How to use the AmiVoice API free coupon
New articles
- How to use Zenn Coupon & Trial
- How to use coupons for Zenn Spring 2026
- "Speech segment ratio" as seen in operational data
Category list
- Introduction to Speech Recognition (15)
- How to improve voice recognition accuracy (12)
- I tried developing it (27)
- How to use AmiVoiceAPI(27)
- Comparison and Verification (6)
- Others(10)
