Start Typing and Press Enter To Search

From 0.99€ / minute

High quality audio and video transcription

On request 1 day delivery time

100% manual method

Prepared for further translation

Timestamps and text smoothing

Human instead of server

Admittedly, the quality of AI and deep learning based transcription tools is getting better and better. We do not shut ourselves off from these new and exciting technologies either. However, there are limitations. As long as you only want to transcribe English media files on various platforms, the result can be OK, since most of these server-based transcriptions are from the USA and the systems are trained accordingly in English, i.e. they are constantly improving. At first glance, these solutions are comparatively inexpensive, but you should not forget that you usually still have to invest your own time to check and correct the transcript. Because: at the moment, you cannot create an error-free transcript automatically. Especially not if several voices are to be distinguished, which are also likely to fall into each other words.

That is why quality takes precedence over speed: our transcripts are created in such a way that they can then be translated and recorded. We transcribe manually and in the highest quality. You do not have to invest extra time, but you can be sure that the result is correct. If you only want a simple transcription, you can also order it from us - from 0,99 € / minute - 100% human.

No hidden costs

Transcriptions at fixed prices without hidden costs, with price and satisfaction guarantee! We do not charge extra for recordings with many speakers (e.g. group discussions) or with limited sound quality.

Any language welcome

Our worldwide professional typists understand every language. Even if you need an exotic South Pacific dialect, we can do it for you! Maybe there are still languages we do not know?

Any file formats and file sizes

As long as one or more people can be heard in your audio and video document, we can transcribe it. You can upload any audio or video file, no matter how large it is.


Simple transcription

from 0.99€/min

  • 80 source languages
  • Approx 7 days delivery time
  • Simple transcription rules
  • Time stamp + 0.15€/min
  • Text smoothing + 0.49€/min
  • 3 days warranty delivery + 0.49€/min
  • or 24h express delivery + 0.99€/min


Our recommendation

from 2.12€/min

  • 80 source languages
  • 3 days guaranteed delivery time
  • Simple transcription rules
  • Timestamps included
  • Text smoothing included
  • Ready for translation
  • 24h express delivery + 0.99€/min


For high demands

from 2.77€/min

  • 80 source languages
  • 3 days guaranteed delivery time
  • Advanced transcription rules
  • Timestamps included
  • Text smoothing included
  • Ready for translation
  • With proofreading (4-eyes-principle)
  • 24h express delivery + 0.99€/min
Experience and quality

AIVOX guarantees an accuracy of at least 99% for clearly audible audio and video files. We guarantee this by using the best specialists and applying a strict system of checks and quality controls.

As a highly qualified team of professionals, we offer the transcription of audio or video recordings at a very reasonable price per minute. In addition to years of experience in listening to and transcribing the original recordings, our employees share their know-how in various disciplines and working techniques. We take on transcriptions in 80 languages, the particular characteristics of each language being taken into account in the typeface design.

Our team works quickly, reliably and with absolute confidentiality. We treat the contents of the files sent to us with the strictest confidentiality, so that internal company information, unique names, patient information, statistical evaluations and similar information are used exclusively for the requested transcription.

Smoothing the transcript

Our transcriptions reproduce the spoken word as closely as possible to the original. In order for this text to be translated and recorded in other languages, for example, a so-called smoothing process must be carried out.

This involves correcting grammatical errors and sentence breaks, for example.

80 languages

We currently carry out transcriptions in 80 languages. In all world languages (e.g. English, French, Spanish, German) we offer this at the same price. We also regularly transcribe larger orders in the following languages without advance notice: German (with dialects), Swiss German, English (with dialects), Spanish, Portuguese, French, Italian, Danish, Swedish, Polish, Turkish, Arabic, Russian.

For the other 60 or so languages in our portfolio, we need some lead time. The reason is that orders in these languages are rather rare and we have to activate our local staff for each project. However, experience shows that we can still deliver your transcript in these languages within a maximum of one week!

Time stamp

Normally the time stamps are inserted each time a speaker is changed. The usual format for this is #hhh:mm:ss#.

Alternatively, a timestamp can be set after certain time intervals (e.g. every minute) or in special formats (e.g. #hh:mm:ss-ms# for MAXQDA).

The time can be either the recording time or a time inserted into the video recording (e.g. for film recordings). Simply tell us your requirements in the comment field when placing your request.

What does the term "transcription" mean?

The term taken from Latin is composed of the words "trans" and "scribere". It means as much as "written over" and describes the translation of a spoken text into writing. This process is called "transcribing" by the person in charge of a transcription.

In practical terms, "transcribing" today means the transfer of words spoken on a recording medium into written text. These can be audio or video recordings that are converted into text.

The activity is called "transcribing", the result "transcript".