Science & Technology

OpenAI Previews New Audio Tool

OpenAI has released information about the early test results of its new artificial intelligence feature, which can read text materials aloud in the most realistic human voice.

OpenAI Previews New Audio Tool

It is worth noting that the mentioned function significantly expands the capabilities of machine intelligence. At the same time, in this case, an additional risk associated with a deep fake is formed. During the period of active dissemination and implementation of artificial intelligence, the problem of cybersecurity has become significantly more relevant. It is worth noting that in the context of this issue, the level of technical literacy of users is important. For example, a query in an Internet search engine, such as How to know if my camera is hacked, will allow anyone to find out about signs of unauthorized access to the device. It is better to spend time getting new information than to lose money or become a victim of compromise.

The new OpenAI feature is called the Voice Engine. The company is positioning the specified product as a text-to-speech model. The feature has already been tested by about 10 developers. So far, the company has decided to abandon the large-scale distribution of the Voice Engine.

A spokesperson for OpenAI said that the firm began scaling back the release of a new product after receiving feedback from stakeholders, including creatives, policymakers, educators, and industry experts.

On Friday, March 29, a message was published in the company’s blog, which notes that realistic imitation of a human voice by artificial intelligence is associated with serious risks, which are especially relevant in the year of the presidential election in the United States.

The Voice Engine can generate the speech of individual people with their specific intonation. The AI needs a 15-second audio recording of talking persons to reproduce their voices.

The custom speech model can also translate the audio it generates into different languages.

Jeff Harris, a product lead at OpenAI, said that the preliminary results of testing the Voice Engine testified to impressive technical quality. At the same time, he noted that the ability to accurately imitate human speech creates great difficulties from a security point of view.

OpenAI said it is soliciting feedback from outside experts before deciding on the wider distribution of the Voice Engine. In this context, the company noted that it is important that people understand in which direction this technology is headed.

Serhii Mikhailov

2165 Posts 0 Comments

Serhii’s track record of study and work spans six years at the Faculty of Philology and eight years in the media, during which he has developed a deep understanding of various aspects of the industry and honed his writing skills; his areas of expertise include fintech, payments, cryptocurrency, and financial services, and he is constantly keeping a close eye on the latest developments and innovations in these fields, as he believes that they will have a significant impact on the future direction of the economy as a whole.