How to translate content programmatically using AI and TransformersPHP

WBOY
Release: 2024-09-01 06:32:07
Original
838 people have browsed it

In this article, I'll show you how to translate content programmatically with PHP using the TransformersPHP library.
Translating text is essential for reaching a global audience and ensuring your content is accessible to speakers of different languages.

Step 1: Set up the project

To get started, please make sure you have the TransformersPHP library installed. You can install it via Composer by running:

composer require codewithkyrian/transformers
Copy after login

During the installation, you have to answer a question:

Do you trust "codewithkyrian/transformers-libsloader" to execute code and wish to enable it now? (writes "allow-plugins" to composer.json) [y,n,d,?]
Copy after login

You'll need to answer yes to enable the Composer plugin to download all shared libraries necessary for TransformersPHP.

Once installed, require the autoload file to load all necessary classes and dependencies:


        
Copy after login

Step 2: Import the necessary classes

Next, you’ll need to import the relevant classes and functions that handle translation:

use Codewithkyrian\Transformers\Transformers; use function Codewithkyrian\Transformers\Pipelines\pipeline;
Copy after login
  • Transformers: This class manages the setup and configuration for translation models.
  • pipeline: This function initializes your specific translation pipeline.

Step 3: Initialize the Transformers class

Before translating content, you must configure the Transformers class:

Transformers::setup()->setCacheDir("./models")->apply();
Copy after login
  • setCacheDir(): This method defines the directory for caching models, which speeds up the process by avoiding repeated downloads.
  • apply(): Finalizes the setup and applies the configuration.

Step 4: Set Up the Translation Pipeline

The next step is to create a pipeline for translation using a pre-trained model:

$translationPipeline = pipeline("translation", 'Xenova/nllb-200-distilled-600M');
Copy after login
  • pipeline("translation", 'Xenova/nllb-200-distilled-600M'): This function sets up a translation pipeline using the specified model, Xenova/nllb-200-distilled-600M, which is capable of handling multiple languages efficiently.

The model used for translations in this example is https://huggingface.co/Xenova/nllb-200-distilled-600M

Step 5: Provide content for translation

Define the sentences you want to translate:

$inputs = [ "The quality of tools in the PHP ecosystem has greatly improved in recent years", "Some developers don't like PHP as a programming language", "I appreciate Laravel as a development tool", "Laravel is a framework that improves my productivity", "Using an outdated version of Laravel is not a good practice", "I love Laravel", ];
Copy after login

This array contains English sentences that will be translated into Italian.

Step 6: Translate the content

Loop through each sentence and translate it:

foreach ($inputs as $input) { $output = $translationPipeline( $input, maxNewTokens: 256, tgtLang: 'ita_Latn' ); echo "?? " . $input . PHP_EOL; echo "?? " . trim($output[0]["translation_text"]) . PHP_EOL; echo PHP_EOL; }
Copy after login
  • $translationPipeline($input, maxNewTokens: 256, tgtLang: 'ita_Latn'): This function call translates each English sentence into Italian, with maxNewTokens limiting the length of the translation and tgtLang specifying the target language as Italian (ita_Latn).
  • trim($output[0]["translation_text"]): Cleans up the translated text by removing any leading or trailing whitespace.

The model supports a lot of languages. To define the target language with the tgtLang parameter, you must use the language code FLORES-200. Here there is a list: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200

In the first execution of the script, the pipeline() function will download all the model files into the directory: models/Xenova/nllb-200-distilled-600M. Be patient, the model is huge, more than 800 MB.

How to translate content programmatically using AI and TransformersPHP

Conclusion

With TransformersPHP, translating content programmatically is a streamlined process. By setting up the environment, initializing the necessary classes, and defining a translation pipeline, you can easily convert text from one language to another. This is particularly useful for creating multilingual websites, applications, or content, allowing you to reach a broader audience effectively.

References

  • TransformersPHP website: https://codewithkyrian.github.io/transformers-php/
  • TransformersPHP source code: https://github.com/CodeWithKyrian/transformers-php
  • Intro article about TransformersPHP: https://dev.to/robertobutti/machine-learning-with-php-5gb
  • How to generate Alt Text with TransformersPHP https://dev.to/robertobutti/how-to-auto-generate-the-image-alt-text-using-ai-and-transformers-php-3onc
  • TransformersPHP official documentation: https://codewithkyrian.github.io/transformers-php/introduction
  • The author, the amazing Kyrian https://x.com/CodeWithKyrian, thank you for all your effort in building this open source PHP project ✨

The above is the detailed content of How to translate content programmatically using AI and TransformersPHP. For more information, please follow other related articles on the PHP Chinese website!

source:dev.to
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!