Military Technology

Chapter 69 The temperature of speech

Wu Hao smiled and shook his head: "No, it's just an unfinished product, there are still many problems that we need to solve.

For example, in the conversation just now, it is more difficult to understand and deal with the vague context. "

"Ambiguous context?"

Zou Xiaodong was stunned for a moment, then quickly understood: "This seems to be difficult for us real people to understand, let alone a machine program.

Boss, I don't quite understand it. Most of the technology companies are currently doing speech recognition and speech dialogue, and the results are also good.

These voice software also have a high degree of recognition of our normal speech, basically reaching more than 99%.

However, the response speed of these software is far less than the recognition speed of our technology, the comprehension ability is not as strong as it, and the processing power of Lenovo is not comparable.

Also, in terms of voice dialogue, how did you get the machine language to be so close to a human voice?

It is important to know that human hearing is still very sensitive, and whether it is a human or a machine program voice can still be quickly distinguished. "

Wu Hao heard a lot of questions from Zou Xiaodong, and asked him back: "What do you think is the biggest difference between human voice and AI voice?"

Zou Xiaodong thought about it for a while, and then replied: "Less Ping Zhe Su Su?"

Wu Hao shook his head and said, "This is not the most important thing. In fact, some voice software on the market can already perform a simple smoothing of frustration."

"That is……"

Wu Hao looked at Zou Xiaodong's puzzled look, and said with a smile: "Emotion, all the voice programs on the market currently lack emotion."

"Feelings, what a joke, how can a program have feelings? This is what people have." Zou Xiaodong shook his head and couldn't understand.

Wu Hao smiled, then controlled the computer to display the schematic diagram on the big screen and said, "It's more about language temperature than emotion.

When we are talking, the other party can clearly perceive the emotional changes when we speak. This is emotion, and this is also the temperature of language.

The language program, it is in accordance with a fixed formula to respond. Therefore, it cannot understand the temperature of each sentence, and naturally there is no temperature in generating speech.

What we need to do is to add an understanding of the language lexical environment in the process of speech recognition, and analyze the temperature of the speech and the emotional changes of the speaker from different tones. "

"I still can't understand how the program can capture the ever-changing emotions that people show when they speak. It is important to know that sometimes slight changes in language and tone can show two completely different meanings and two emotions, How does the machine distinguish." Zou Xiaodong said his doubts.

Wu Hao smiled and demonstrated the content on the screen, and replied to him: "This is the application of AI technology, everyone's language and intonation are different, and the emotional expression is also ever-changing. If we follow the traditional way, we need to deal with these ever-changing expressions. The language intonation context is collected and analyzed to define it. If this is the case, the workload is too great.

Therefore, the learning and evolution ability of AI technology allows me to find an idea. We can train a set of basic AI voice programs by capturing the massive voice information of love on the Internet.

Of course, this is just a sample of the basic program, we need to adjust accordingly according to the user's habits. Let the program learn to adapt to the user. The longer the user uses it, the more accurate the recognition and understanding of the AI ​​recognition program. "

Speaking of this, Wu Hao said with a smile: "This is actually very similar to the process of getting along with real people in the real world. After two strangers get to know each other, both sides are gradually getting to know each other.

The more time passed, the more familiar they became. Even a simple word, gesture or gaze from one party can be accurately received and understood by the other party. This is the so-called tacit understanding.

What we need to do is to cultivate a tacit understanding between programs and people, but it is very difficult for users to change, and can only have a subtle influence. So we have to start with the program software, let it adapt to the user, and change the user subtly.

Only in this way, the human-computer interaction will be more tacit.

That's why when I was talking to 10 earlier, it couldn't understand my ambiguous context. It didn't adapt to my speaking habits, so it didn't understand what the vague words I said meant.

Like what, how many, how many, then, where, random, these uncertain vague words, the program is difficult to understand and deal with. And this requires us to give these words a basic definition. This definition should not be rigid and rigid, and it must be modified and changed according to the context of the user. "

After saying this, Wu Hao looked at Zou Xiaodong and said seriously: "Only after the program understands the emotional temperature in our real people's words, can the program simulate a voice similar to real people's speech."

"In any case, this is a major breakthrough in the field of AI voice technology. I think once this technology is released, it will definitely shock the world, and it represents the real arrival of the era of intelligent voice.

To be honest, I can't wait. "Zou Xiaodong said excitedly, licking his dry lips.

Wu Hao waved his hand and said, "It's not as exaggerated as you said, but it is indeed a major breakthrough in technology."

"Boss, do you plan to target this technology directly to the mass consumer market, or to cooperate with enterprise users to sell technology and related patents, or to provide services for them with open source relaxation." Zou Xiaodong asked him curiously. This is a heavyweight technology that will shake up the industry no matter who it works with.

"What do you think?" Wu Hao didn't answer directly, but asked instead.

Zou Xiaodong thought about it for a while, and then said seriously to Wu Hao: "If an enterprise wants to become bigger and stronger, it cannot be limited to a single field. Although cooperation with enterprises can save a lot of things, it is very risky. Once a cooperative enterprise With access to more advanced technology, we run the risk of being abandoned.

So I think we should develop mass market, use this technology to build our brand among the people and expand our influence. Only in this way can we reduce unnecessary trouble and resistance in future development. "

"The analysis is in place, but this market has huge potential, and monopoly alone is definitely not enough. We still need to cooperate with those companies. Of course, we can't lag behind in the mass market.

So I'm going to do both, and this smart voice assistant is what I've built for the mass market. How about, put out the video I demonstrated just now, and you say how the society and the industry will react. "Wu Hao asked with a smile.

"You mean... Haha, I'm looking forward to it!"

Tap the screen to use advanced tools Tip: You can use left and right keyboard keys to browse between chapters.

You'll Also Like