AI to be Used to Create 3D Motion Sculptures

The system developed by the MIT and Berkeley scientists is called MoSculp and is based on artificial inteligence
21 September 2018   304

MoSculp, the joint work of MIT scientists and the University of California at Berkeley, is built on the basis of a neural network. The development analyzes the video recording of a moving person and generates what the creators called "interactive visualization of form and time." According to the lead specialist of the project Xiuming Zhang, software will be useful for athletes for detailed analysis of movements.

At the first stage, the system scans the video frame-by-frame and determines the position of key points of the object's body, such as elbows, knees, ankles. For this, scientists decided to resort to the OpenPose library, developed by the Carnegie Mellon University. Based on the received data, the neural network compiles a 3D model of the person in each frame, and calculates the trajectory of the motion, obtaining a "motion sculpture".

At this stage, the image, according to the developers, suffers from a lack of textures and details, so the application integrates the "sculpture" in the original video. To avoid overlapping, MoSculp calculates a depth map for the original object and the 3D model.

MoSculp 3D Model
MoSculp 3D Model

The operator can adjust the image during the processing, select the "sculpture" material, color, lighting, and also what parts of the body will be tracked. The system is able to print the result using a 3D printer.

The team of researchers announced plans to further develop the MoSculp technology. Developers want to achieve from the processing system more than one object on the video, which is currently impossible. The creators of the technology believe that the program will be used to study group dynamics, social disorders and interpersonal interactions.

The principle of creating a 3D model based on human movements has been used before. For example, in August 2018, scientists at the same University of California at Berkeley demonstrated an algorithm that transfers the movements of one person to another.

Microsoft to Use AI to Create Human Voice

Synthetic voice is nearly indistinguishable from recordings of people
27 September 2018   457

Researchers from Microsoft recorded computer voice, imitating human speech. To overcome the difficulties of the traditional model, they used neural networks for speech synthesis. Microsoft promises to provide support for 49 languages ​​and the ability to create unique voices for the needs of companies in the near future.

Synthesis of speech with the help of neural networks involves comparing the stress and length (so-called prosody) of the speaker's speech units, as well as their synthesis into a computer voice. In systems of traditional speech synthesis, prosody is divided into acoustic and linguistic analysis, controlled by various models. As a result, the speech is noisy and indistinct. Representatives of Microsoft argue that in the model of neural synthesis two stages are combined into one, so the voice sounds like a real one.

The developers are convinced that the synthesis of speech with the help of neural networks will make it more natural to communicate with virtual interlocutors and assistants. Moreover, it will enable you to convert e-books into audiobooks and will allow you to change the scoring of built-in navigators.

Microsoft Neural TTS
Microsoft Neural TTS

Azure computing power is available for real-time use, and Azure Kubernetes is responsible for this. Simultaneous application of neural synthesis of speech together with traditional speaks about expansion and increase of availability of service. At the moment, there are a female voice named Jessa and a man named Guy.

Microsoft is competing in speech recognition and synthesis technologies with Google, which updated its services in late August 2018. Google Cloud announced the release of a stable API for the synthesis of speech Cloud Text-to-Speech with the experimental function of audio profiles and support for several new languages.