Archive for the ‘Technology and software’ Category

Making e-learning effective worldwide: Achieve real learning success with professional localisation

Posted on: November 27th, 2025 by Frank Wöhrle No Comments

E-learning is considered a central pillar of continuing professional development in many companies – from global onboarding courses to complex product training. At the same time, projects repeatedly face similar challenges: Content that works extremely well in the country of origin loses its impact in other markets, is misunderstood or simply not used. The reason for this rarely lies in the didactic concept itself, but rather in the type and quality of localisation.

Why companies rely on e-learning

From a business perspective, numerous factors speak in favour of digital learning formats. Employees can learn flexibly – regardless of location, time zone and device, which suits geographically dispersed teams.

E-learning supports independent learning at any time: Content is available on demand, without having to rely on specific training dates. Thanks to their modular setup, learning units can be clearly structured, specifically combined and, if necessary, updated gradually.

Another advantage is that the learning pace is down to each individual: Employees can pause, repeat or delve deeper into complex content without disrupting anyone else’s flow. In addition, digital training makes it possible to create customised learning content – tailored to specific roles, regions or target groups within the company.

Multimedia elements such as videos, animations, interactive exercises and quizzes create a rich learning experience and increase engagement. Offering content in multiple languages contributes significantly to accessibility and, for international workforces, makes real headway in terms of removing barriers to learning.

Ultimately, when these factors are successfully implemented, they lead to increased learning success – measurable in terms of knowledge transfer, application in daily work and reduced error rates.

Complexity of modern e-learning formats

In practice, it quickly becomes apparent that e-learning courses are significantly more complex than traditional training materials. A typical module includes slides or screen recordings, embedded videos, spoken commentary, subtitles and interactive elements such as quizzes, conversation simulations and the like.

When it comes to localisation, this means that content to be translated is not contained in a single file or format, but is distributed across a variety of authoring tools such as Adobe Captivate, Articulate Storyline, Articulate Rise, iSpring, Elucidat, Lectora etc., SCORM packages, video and audio scripts, and, where necessary, external sources (e.g. course descriptions, content in files linked to the course, where applicable) and, in some cases (video and audio scripts), still need to be transcribed before localisation. In addition, there are technical requirements – such as support for character sets, space restrictions in buttons, and synchronising subtitles and voiceovers.

Underestimating the complexity of this process often leads to problems during the project: Missing text exports, texts with similar content in different formats, untranslated UI elements or videos that have to be edited retrospectively at great expense. For a localisation process to run smoothly, therefore, a structured approach that takes all components into account from the outset is crucial.

Learning in your native language: an efficiency factor

From a didactic perspective, it is well documented that learning content is best internalised when delivered in one’s own mother tongue. Learners then need to expend less cognitive effort in understanding the language and can concentrate more on content, context and application.

This is particularly important when dealing with complex, security-related or legal issues in order to avoid misunderstandings and misinterpretations. Emotional access also plays a role: Language can influence how credible, esteemed and motivating a training course is perceived to be.

For companies, this means that even employees with good foreign language skills benefit from training in their native language – namely, by making faster and more stable progress in their learning. Those who systematically utilise these positive consequences are able to significantly increase the effectiveness of global learning programmes and, at the same time, justify the investment in localisation.

 

Lektorin sitzt lächelnd mit Headset an Schreibtisch in modernem Büro.

The role of professional specialist translations in e-learning localisation

In order for e-learning courses in other languages to achieve the same learning objectives as the original, a basic word-for-word translation simply cannot do the job.
Native-speaking specialist translators combine linguistic competence with industry knowledge and are familiar with the terminology and common phrases used in their respective fields of expertise.

They ensure that technical terms are used consistently, instructions are clear and action-oriented, and didactic subtleties are preserved.
At the same time, they adapt examples, metaphors or references if these are not readily translatable with regard to either culture or context.

Professional translation therefore makes a significant contribution to learning objectives being achieved quickly: Content is easier to understand, easier to remember and more likely to be put into practice.
A clearly defined terminology and review process also supports company-wide consistency, both in terms of corporate documentation and individual learning outcomes. This is especially the case when dealing with a vast number of courses and a multitude of languages.

The importance of professionally localised audio and video

Audio and video are key carriers of information and sources of motivation in modern e-learning courses – and pose particular challenges for localisation.
Voiceover texts must be translated in such a way that they match the visual material in terms of tone, length and rhythm, while at the same time being technically accurate.

For voiceovers, you also need to select suitable narrators or satisfactory AI software to suit your corporate image and target audience.
In addition to the voice, elements such as the narrator’s gender, age, pronunciation quality, accent and/or dialect, and any background sound such as music, etc., are crucial in order to avoid misunderstandings and convey an entirely professional feel.

The client’s specifications regarding desired pronunciation, use of abbreviations, accessible (i.e. gender-neutral) language, and so on, are essential, as the client’s satisfaction will be strongly tied to how well these requests are implemented.

Subtitles, on the other hand, must be precise, easy to read and synchronised with the spoken word. The wording must be concise as well as complete, with some rephrasing required.
Last but not least, visual elements – such as text overlays or UI screens – may need to be adapted to ensure that they remain understandable in the target language and fit in with the design in terms of form.

Licence for e-learning

Last but not least, when using human voices for voiceovers, the customer must also clarify the intended use and reach. Are the e-learning courses with voiceover to be used exclusively internally or also publicly? Are there any plans to sell the courses commercially to third parties?
Depending on the type of transmission and the type of media/rights usage, the recording studio involved in the project may charge a licence fee per voice artist used. In most cases, these are flat fees with indefinite validity.

Summary: Localisation as an integral part of the e-learning strategy

E-learning can only reach its full potential when content is tailored to the specific language and culture.
Companies wishing to roll out e-learning courses internationally would do well to consider localisation as an integral part of the conceptualisation process from the outset – rather than as a downstream translation step.

The combination of didactically excellent courses, native-language specialist translation and professionally localised audio and video elements forms the basis for true learning success in multiple languages.
This makes global training programmes consistent, efficient and effective – and fulfils the requirement to make knowledge accessible across the globe without ever compromising on quality.

Get in touch to find out how we can help in ensuring your learning content achieves exactly the international impact you want – we speak your language!

Terminology in the translation process: Why it’s more important than ever in the age of AI

Posted on: October 30th, 2025 by Frank Wöhrle No Comments

Terminology work has always been the key to consistent, high-quality translations. The advent of AI and large language models (LLMs) is fundamentally changing translation processes. But this is precisely what makes terminology even more important.

“AI now translates everything perfectly – so why do we still need terminology management?”
As a language service provider, we are increasingly hearing this question being posed. But anyone who has ever seen how a single mistranslated technical term can distort a product description, manual or marketing message knows that terminology is not a side issue – it is the foundation of high-quality translation.

In the age of LLMs and generative AI, the how of handling terminology is changing. The why, however, remains unchanged.

Why terminology is the backbone of every translation

Terminology is much more than just a dictionary. It defines how a company talks about its products, services and values.

Whether we’re talking about “controllers”, “control modules” or “control units”, the right term ensures recognition, trust and legal certainty.

Without consistent terminology, inconsistencies are bound to arise. In practice, this leads to:

  • different translations for the same term,
  • increased correction effort,
  • unnecessary additional costs, multiplied by the number of target languages,
  • inconsistent brand communication,
  • misunderstandings among customers or users.


Consistent terminology management is crucial for ensuring consistency and precision in the translation process, especially when dealing with high-volume, multilingual content from the fields of technical documentation or marketing.

From glossaries to integrated solutions: Successful translation processes thanks to terminology

In the past, terminology work was often handled outside of the actual translation process – in the form of Excel spreadsheets or static lists. Today, it can be seamlessly integrated into translation management systems (TMS).
This enables:

  • automatic terminology suggestions directly in the CAT tool,
  • terminology checks during translation,
  • centralised maintenance and approval processes.


This makes terminology a living part of the workflow, not just an afterthought during quality control.

How AI and LLMs are changing terminology work

AI systems and LLMs open up new possibilities for maintaining terminology in a more dynamic and intelligent way. Some specific applications may include:

  • AI terminology extraction:
    AI can quickly analyse multilingual texts to automatically recognise relevant technical terms and suggest them as term candidates. This saves time during the creation phase and helps to identify terminology that has not been taken into account previously. However, final validation remains the task of human experts.
  • Building a terminology database:
    If translations or a defined structure have yet to be established, generative AI can support the creation of a terminology database. This allows variants and synonyms to be clustered efficiently, while metadata such as context, grammatical information or suggested definitions are generated automatically. However, the final review and validation stages are still handled by humans.
  • Terminology checks by AI:
    Terminology errors identified by a rule-based check are sent to the AI, where they are evaluated and corrected in their overall context, taking into account additional terminological information.


These new approaches make terminology work faster, more scalable and more data-driven. At the same time, it remains dependent on human validation – because AI does not automatically understand corporate language or brand values.

Limitations and risks: When AI ‘invents’ terms

As powerful as LLMs are, they also pose a real risk. This is because an AI model can:

  • ’hallucinate’ terms – i.e. it can create plausible but incorrect terms,
  • overlook customer-specific requirements if these are not clearly specified in the prompt or system,
  • confidential terminology data is at risk if it is fed into publicly accessible systems.


The conclusion? AI can support, but not decide. Human expertise remains indispensable when deciding whether a term is terminologically correct, brand-compliant and contextually appropriate.

Best practices: How we combine human expertise with the power of AI

As a language service provider, we see the added value in using technology sensibly – not automating everything blindly. Successful terminology work in the age of AI is based on five principles:

  • Centralisation:
    All terminology data belongs in a central database – not in miscellaneous lists scattered far and wide.
  • Integration:
    Terminology must be directly linked to CAT tools so that translators can access it in real time.
  • AI as a support, not a substitute:
    AI tools can assist with research, extraction and checks – but final validation remains in human hands.
  • Security-conscious:
    Sensitive terminology data should only be processed in data protection-compliant, controlled systems.

 

Terminology remains strategic corporate knowledge

Artificial intelligence and large language models are fundamentally changing how we work with language – but they are no substitute for terminology management. When used correctly, they actually make it more efficient and intelligent. Terminology is a company’s linguistic memory.
Particularly in the age of generative AI, clear and well-defined terms are crucial to ensure that man and machine truly speak the same language.

Contact us if you want to build your terminology efficiently, maintain it consistently and optimise it with AI support – we will support you every step of the way.

Learn more about our services in combination with AI for efficient terminology management

Transit NXT Service Pack 18: Smart functions for translation workflows – now available!

Posted on: September 9th, 2025 by Frank Wöhrle No Comments

With Transit NXT Service Pack 18, the STAR Group is introducing a powerful extension to its translation memory system – a real game changer for professional translators and project managers!

What’s new? – Highlights at a glance

  • Various improvements: The update enhances existing functionalities to make translation even more efficient. A variety of user requests have been implemented
  • Translation of variables in InDesign documents: A particularly exciting addition is that variables in InDesign documents can now be translated directly. This gives translators additional flexibility when dealing with complex layout files.

Why is it worth upgrading to Service Pack 18?

  • Machine translation: DeepL Pro now also supports language variants in glossaries (e.g. for English, Portuguese and Chinese). For Textshuttle, you can now control whether terminology from project dictionaries is transferred to Textshuttle or not.
  • Project exchange: Transit NXT now supports Phrase projects. Users can unpack MXLIFF files directly, translate them and import them back into Phrase.
  • Optimised web search: In the integrated web search, the prioritisation of services has been optimised in order to obtain initial results even faster.

Which stakeholders benefit most from SP 18?

  • Professional translators who frequently work with DTP tools such as InDesign and want to edit the content of variables efficiently.
  • Project managers who want to equip their teams with an even more powerful CAT environment.
  • Companies that strive for high quality, speed and flexibility in localisation.

Follow this link to download the Service Pack

You can find more information at: https://www.star-deutschland.net/en/technology-and-software/software-products/

Transit NXT: The underestimated CAT tool that has the professionals convinced

Posted on: July 2nd, 2025 by Frank Wöhrle No Comments

Anyone who regularly works with CAT tools (computer-aided translation software) probably thinks of Trados Studio, memoQ or Across first. One name is often overlooked – and unfairly so: Transit NXT: the underestimated CAT tool from the STAR Group. It is a genuine powerhouse for anyone who wants their work to be structured, consistent and terminology-focused.

What actually is Transit NXT?

Transit is a professional CAT tool that has been on the market since the 1990s. It combines classic segmentation with a project-orientated working method – incorporating translation memory, terminology management, preview options, quality checks and various functions designed specifically for technical documentation.
The extensive and growing portfolio of AI features, which are demonstrated in a series of short videos on our YouTube channel, are not to be missed.

5 reasons why so many professionals have put their trust in Transit NXT for years

1. Up-to-date and contextualised terminology

Transit works seamlessly with TermStar. Live terminology entries are displayed to translators within the editor itself – including the definition, context and source of the term. This extensive integration is a clear benefit over those tools where the terminology often features only in the sidelines.

2. Project structure, not file chaos

Unlike other CAT tools, Transit thinks in terms of projects with a clear-cut file structure. This takes the hard work out of managing big or lengthy translation projects – especially when it comes to regular updates or complex workflows.

3. Need technical formats? No problem with Transit.

Whether DITA, XML, FrameMaker, InDesign or XLIFF – Transit leads the way when it comes to the variety of natively supported file formats. Many other tools need extra modules or conversions to handle these files.

4. Local installation – full data sovereignty

Transit NXT works entirely locally – without any cloud obligations. For companies that have high data protection requirements, this is a crucial advantage over cloud-based solutions.

5. Quality assurance at the highest level

With automated checks, an in-context preview function and variant check, Transit NXT offers precise quality management for an impressive level of efficiency that is especially beneficial for those handling technical content.

 

Transit Software Bedienoberfläche

Who is Transit most suited to?

  • Technical translators working with complex formats.
  • Public authorities, industrial companies and service providers who need to keep sensitive data locally.
  • Freelancers who attach great importance to a reliably maintained terminology.
  • Translation agencies that want an efficient tool for managing large structured projects.

Sound good?

Transit NXT is no entry-level tool – but that is precisely what makes it a great option for anyone who values structure, terminology and format variety.

If you want to see for yourself how Transit works, simply request a non-binding trial version now.

STAR is a certified SCHEMA ST4 translation service provider

Posted on: May 30th, 2025 by Frank Wöhrle No Comments

We have successfully completed the training to become a certified translation service provider for the SCHEMA ST4 content management system. As such, STAR Deutschland is now an official certified translation service provider for SCHEMA ST4.

What is SCHEMA ST4?

SCHEMA ST4 is a professional content management system that more and more companies are turning to when producing technical documentation. It assists users in the creation, management and publication of multilingual product documentation (manuals, instructions, catalogues, online guides, etc.).

SCHEMA ST4 is an XML-based editing system that separates the layout from the textual content. In technical documentation, this is very beneficial when reusing text fragments and when managing multiple languages and versions.

SCHEMA ST4 finds application in a broad spectrum of industries, e.g. in the automotive sector, in mechanical and plant engineering or in pharmaceuticals. One major benefit of this system lies in the extensive optimisation of the translation process, which in turn reduces costs.

Training content and key training topics

The “Translation Management” training programme covers the various steps of the translation process, namely:

  1. Selecting the right text fragments
  2. Exporting the text content for translation, if necessary using COTI
  3. The subsequent import of the translated content into SCHEMA ST4

The training also offers insights into potential challenges that may be encountered, both in terms of the editing and the translation.

Translation process for SCHEMA ST4 content

The SCHEMA ST4 content management system is one of the most frequently implemented solutions in technical editing among STAR’s customers.

Let us assist you with our in-depth knowledge of the SCHEMA ST4 translation interface and the related processes.

Get in touch now with no obligation!

Translate more efficiently with AI – but how?

Posted on: February 26th, 2025 by Frank Wöhrle No Comments

AI: What started as a buzzword, and then became an established term in everyday language, is now a basic requirement for many applications and processes. And this technology is not stopping at the language industry either. Since the launch of ChatGPT we know that translating can now also be completely interactive. Large Language Models, also know as LLMs, in chatbot form are now flooding the market. It feels as though a new model is popping every week, announcing its intention to outdo its competitors in terms of efficiency, quality and reliability. Neural machine translation (NMT) doesn’t seem that old – and yet we are already discussing when this technology will disappear from the market and be replaced by generative AI.

The key question is: I want to translate more efficiently with AI – but how?

AI for targeted optimisation of translation quality

Even though the technology has made significant progress over the last five years, the results of the commonly used and established NMT systems are not always good enough. This can have a variety of causes:

  • The desired language combination has not been trained with sufficient material or goes via a pivot language (often English). This can lead to structural problems or errors in meaning.
  • The MT system does not recognise specialist or customer-specific terminology.
  • The MT system was used for content in which style is extremely important or the translation needs to be targeted towards a specific target group.


Manuals, marketing texts or content with high customer visibility therefore often do not achieve the desired levels of quality through machine translation alone. Language professionals then optimise the machine-generated texts as part of a post-editing process. Machine translations are carefully checked, compared with the source text and corrected if necessary.

As a central translation platform, the CAT tool enables users to work efficiently and offers targeted support for quality assurance thanks to a range of automated features. But where exactly is AI being used here? LLMs such as ChatGPT from OpenAI are perfectly capable of producing translations that, like DeepL or Google Translate, provide a good starting point for further processing, depending on how it is to be used.

However, a significant leap in quality can be achieved by improving the translation requests through the targeted use of prompts and the addition of reference files. To achieve this, however, in addition to a well thought-out prompt engineering design, validated translation resources in the form of translation memory and terminology databases are a fundamental prerequisite.

 

AI for better translation resources

As with any new technology, a question often arises: What can AI do for me?

However, if you want to integrate AI into your language processes in the long term, you should first ask yourself: What can I do for AI?

Well-maintained translation resources make a significant contribution to improving the results of your AI solution. Take the topic of terminology, for example. If you use a generic system such as DeepL for your translation processes, you will receive translations that do not match your company terminology – unless you integrate a glossary.

Are you only at the stage of establishing your terminology but don’t want to miss out on the benefits of MT? Use language models to extract potential terminology from your monolingual or multilingual documents. You can also use AI to check your translation memory databases, for example to find inconsistent translations or to automate clean-up or correction across large data sets. Use these resources consistently to increase the translation quality of your language model or improve the output of NMT systems.

AI as co-pilot? Reach your destination safely with the new STAR webinar series

As you can see, we are extremely enthusiastic about the topic of AI – and we don’t claim to be reinventing the wheel. However, the technology offers a lot of potential for optimisation if it is used efficiently and sustainably.

Of course, we would like to share our enthusiasm with you and invite you to our webinar series “AI as co-pilot: Forging new paths to smart language processes” starting in March. All the webinars will be held in German only.


How exactly does generative AI actually work? What advantage does it offer for the translation? How can I use language models to create product texts? Can I train my own AI? And what actually happens to my data?

Our Language Technology Consultant Julian Hamm answers these and many other questions and discusses the many different uses of generative AI, including translation, terminology management, content creation and content delivery. You can expect the following content in the first group of topics:

  • Was ist generative KI, und wofür kann ich sie einsetzen? (What is generative AI and what can I use it for?)
  • Wie kann KI bei der Übersetzung unterstützen? (How can language experts benefit from AI?)
  • Wie kann ich KI für die Terminologiearbeit einsetzen? (How can I use AI for terminology work?)
  • Welche Vorteile bietet KI für die Content-Erstellung? (What advantages does AI offer for content creation?)


Further information on the events and the registration form can be found here.

We look forward to you joining us!

Transit NXT Service Pack 17: New AI functions and more

Posted on: January 31st, 2025 by Frank Wöhrle No Comments

With the latest Transit NXT Service Pack, you can benefit from a host of new features to speed up your processes.

New file formats

Transit NXT has been once again expanded to include the very latest file formats with Service Pack 17. Documents from InDesign 2025 and drawings up to AutoCAD 2025 can now also be translated as optional file versions. Documents from the Google Docs Editors Suite are now also officially supported as another file format.

Machine translation

Integration of Amazon Translate.

With the addition of another MT system, translations can now also be requested via Amazon Translate – interactively directly in the translation editor, or as an option automatically when importing the project. New functions are also available for DeepL, Systran and Textshuttle.

Project management

Professional support in the editor thanks to integrated MS Word grammar check, AutoCorrect and AutoComplete functions.

TermStar

For TermStar, we are focusing on terminology export for this Service Pack: For one, TBX version 3 is now officially supported. What’s more, this Service Pack also makes it possible to export multimedia files (e.g. graphics or videos) from dictionaries to certain formats.

New editor functions

Translators can look forward to additional helpful editor functions:

The AutoComplete function makes it quicker to enter words and phrases with project-specific suggestions from dictionaries and translation memories.

AutoCorrect corrects typical typos, typographically converts quotation marks and makes it possible to use shortcuts to enter special characters and frequently used phrases. Date and number formats as well as alphanumeric strings can now be adapted to the target language format with a simple mouse click.

For quality assurance, the translation can now also be checked for correct grammar and corrected interactively. What’s more, Russian for Kazakhstan is now available as an additional working language.

Follow this link to download the Service Pack

You can find more information at: https://www.star-deutschland.net/en/technology-and-software/software-products/

2024 – the year of AI: Important developments and lessons learned

Posted on: December 16th, 2024 by Frank Wöhrle No Comments

Another year is drawing to a close, and we can hardly believe how fast the time has flown by. Now is a good opportunity to take a look back at all of the important developments that 2024 – the year of AI – has brought us, and give you an insight into what next year has in store for us.

AI has been a hot topic ever since OpenAI stunned the whole world with ChatGPT. Companies are increasingly insisting on using AI wherever this seems possible. From many discussions and exciting customer projects over the course of the year, we have identified key lessons learned and trends in this field.

Five key trends relating to the use of AI in the context of translation

  • Expectations for generative AI remain very high.
    However, the purposes for which people want to use it differ greatly, especially in language processes: From the fanciful idea of a wonder machine which produces, translates and optimises texts so they are perfect, through to a clever tool that provides targeted assistance with specific tasks that are usually performed manually at present. The increasing integration of large language models into translation processes makes exactly this possible by providing these with targeted and modular support. This ranges from the bilingual extraction of terminology and the post-editing of machine-translated content, through to assessing the quality of multilingual documents.
  • If you want to use the terminology efficiently and sustainably, you also need high-quality, well-structured language resources to be able to supply the language models with relevant information.
    This means that years of working with translation memory and terminology management systems now offers double the benefits. If this data is prepared in a structured and sustainable manner, language models can use it to optimise machine-translated content, for instance in the form of retrieval-augmented generation (RAG).
  • The topic of data protection continues to generate extreme uncertainty despite the adoption of the EU AI Act in May 2024.
    Many companies are looking for ways to use AI in the most secure possible way in order to protect their precious data against misuse.
  • A lot of businesses are experiencing issues with the scalability of AI solutions, whether this concerns the IT infrastructure, financial resources or further training of staff.
  • Human in the cockpit. People will increasingly return to the centre of the AI-based translation workflow.
    While translators were previously responsible for the post-editing of predefined machine-translated content, among other tasks, as part of human in the loop concept, the new human in the cockpit principle aims for translators to use modern language technologies – even interactively – in order to exert their own influence on the output and to create efficient design processes.
    The technological transformation is also resulting in changing requirements for current and future language experts. The relevant universities have also recognised these developments and are revising the degrees and courses they offer accordingly. For instance, prompt engineering, language technologies and information management are important focal topics that will feature more often on the curriculum in future.

Are you interested in this subject? Then don’t miss our STAR webinar, which is scheduled for early 2025. There, we will be sharing information about current trends and our latest technological developments.

Certified processes: The foundations for AI integration you can trust

Posted on: November 25th, 2024 by Frank Wöhrle No Comments

This year, STAR Deutschland GmbH once again welcomed its independent certification partner LinquaCert to its Sindelfingen office for the ISO 18587:2017 surveillance audit (“Post-editing of machine translation output”) shortly before the annual tekom conference. We are pleased to confirm the successful recertification in line with this standard that relates explicitly to quality assurance in the production of machine translations.

Spotlight on terminology integration and automation in quality assurance when incorporating AI into translation processes

As well as active discussions on qualifications, training measures and quality measures, there was once again a real need to discuss the integration of generative AI into translation processes.
The spotlight was shone primarily on the topics of terminology integration and automation in quality assurance that provide more tailored support to the linguists delivering MT post-editing projects and are designed to reduce the processing effort.
As a longstanding technology partner and language service provider, we embrace current trends and give our translators the expertise they need to be able to work efficiently and in a future-oriented way.

Missed tekom, but want to know more about AI? Our STAR webinar exploring “Augmented Translation” offers a quick insight into the latest developments in language technologies. Register here to be sent the webinar recording: https://www.star-deutschland.net/en/language-management-consulting/machine-translation-and-post-editing/star-webinar-augmented-translation/ 

Want to get the most out of modern language technologies and are committed to delivering high-quality translations? We have the right service for you: https://www.star-deutschland.net/en/language-management-consulting/machine-translation-and-post-editing/

AI for voices, voice recordings and voice-over translations

Posted on: October 28th, 2024 by Frank Wöhrle No Comments

Can AI help to create high-quality content in any language while adhering to corporate language and specific rules?

Today we’re interviewing David Heider, the owner of a STAR partner sound studio in the Czech Republic, to shed light on this fascinating question – can artificial intelligence be effectively used in the area of video and audio productions?

STAR: David, when did you start offering professional audio productions?

Our recording studio has been providing its services since 1999 and we’ve specialised in the spoken word. We cover two different areas. Firstly, the “corporate world”, with recordings of material for internal purposes, such as e-learning. This also includes localisation of internal company systems and software. This can be either training material or various web-based platforms with voice output or automatic operators on your phone, sat nav, etc.– in short, various applications where we often have to cut the sound word by word or even syllable by syllable and where everything is then put together by a system into sentences and whole messages.

The second area is more artistic in nature and covers advertising and promotional videos, among other content. This area differs from the “corporate world” previously mentioned in that it’s not just about conveying content, but rather about a form that appeals to listeners and attracts them. So we need professionals who can express themselves artistically and use their voice skilfully. To summarise, you might say that our first area of action is to provide information. This is about content where users, to put it more clearly, don’t have much choice, as they generally have to listen. In contrast, artistic productions aim to seduce the “audience” in some way, not only in terms of content but also their form.

Tonstudio

STAR: This inevitably leads me on to the next question – can AI be used in your work?

AI is an amazing tool and offers numerous advantages. For example, we don’t need to contact a voice-over artist and make an appointment; the AI is always available.

STAR: Are you already using AI?

Yes. We use AI in some cases for preparing and producing audio material. But there’s also a downside. In most languages, the AI voice seems artificial or boring, especially after listening to it for a long time.

STAR: Can’t AI intonate?

Intonation in itself isn’t usually a problem, but the AI does it in unnatural inflections, which is really inconvenient. Often it doesn’t emphasise the core message, which a person would normally express through a particular emphasis. And when you listen to an AI recording, you get this unnatural inflection on repeat that starts to get annoying after a while, because you can’t shake the feeling that it’s actually just “copy-paste”. In comparison, I find it much better in English than in other languages, where the AI can work with variable intonation and make the voice sound very natural and lively. But in all the other languages, we still have a long way to go before we reach that point. At the moment, the other languages still sound very “plastic”.

STAR: Are there any other disadvantages to AI voices?

There’s a second point that I think is more serious, especially with e-learning. As with any AI, the quality of the output depends on the quality of the input. You also always have to prepare the content correctly for AI voices. Perhaps the AI doesn’t read all the abbreviations correctly, e.g. in the same way as you would read them in a specific corporate culture. Every company has its own corporate jargon and the AI won’t take this into account. This also applies to different product names, place names and foreign words. For example, if French names appear in English text, should it be read in French or English?

STAR: How can this be explained?

Only the employees at a company are really familiar with the corporate language and know why a certain linguistic rule can sometimes be ignored for internal company content or marketing reasons. And the listeners are insiders, i.e. they usually know what the content’s about. Companies also have to be consistent, otherwise it will sound strange to their ears. Sometimes, of course, a term or abbreviation can be misunderstood, either phonetically or for names, but that’s just the way it’s done at the company and we should respect it.

STAR: What other challenges are there?

Abbreviations and other specific features are a major challenge for AI. They usually need a lot of adjustments and corrections, which can result in the final price being similar to that of a traditional voice-over. We need to create pronunciation tips or edit the text so that it’s easy for the AI to read. This is very time-consuming – so AI makes little sense for a one-off project. In addition, we also “proof-listen”, i.e. do a listen-through to check, after the AI.

STAR: Don’t you “proof-listen” for human speakers too?

If there are two of us in addition to the speaker during the recording, we don’t do this any more because we can hear and check everything during the recording. The exceptions are languages that we don’t understand, such as Asian languages. But, in the case of AI, we don’t know beforehand what it knows and what it can read. I’ll give you an example. Let’s take the unit of a “megapascal”. This term has the abbreviation “MPa”, and the AI can read it as “em-pee-ay”, which is complete nonsense to a technical expert. So we’ve got to figure out how to get the AI to read it correctly as “megapascal”.

Sometimes we go through the recording and it seems right to us, but then the customer finds something that doesn’t fit their corporate culture. That’s why, while I think AI is a useful tool in certain informational texts that can make work faster and cheaper, and I’m happy to recommend it, in the hands of an inexperienced user, AI can behave unpredictably, and the end product will cause more disappointment than enthusiasm about the resources saved.

STAR: Is there a financial difference?

Yes, using AI reduces the budget to around half or two-thirds, as the work is mainly done by a machine and no voice professionals are involved in the process.

STAR: What do you do if a recording isn’t suitable for AI?

We are the guarantor of quality, and if we have serious and justified doubts about whether AI will lead to the right result, we’ll inform the customer. But customers also want to have personal experiences of this. I then try to point this out first by saying, “don’t be disappointed, but I don’t think AI is suitable for this particular project.” When I feel that I’ve outlined everything, I leave the decision up to them. But in some cases, customers themselves are unsure and are grateful for our support.

STAR: Thank you, David, for this very interesting discussion about AI in audio recordings.

Bild von David

AI voices aren’t yet perfect, and human voices are still winning the race. They’re able to convey emotions and leave a strong impression. However, AI voices are an inexpensive alternative. Please feel free to contact us for our advice.

David Heider,
owner of a STAR partner sound studio in the Czech Republic