Skip to main content

We’ve only just ‘scratched the surface of what’s possible’ with Alexa, exec says

amazon echo in the dark
Greg Mombert/Digital Trends

Alexa is already one smart cookie. The perennially popular smart assistant from Amazon has quickly become one of the most popular helpers on the market, and is capable of helping its users with everything from controlling their smart homes to answering pressing questions about life to making announcements to the household. But according to Rohit Prasad, the vice president and head scientist of the Alexa division at Amazon, we ain’t seen nothin’ yet. In fact, according to an interview with Prasad published in the Amazon blog, his team has only “scratched the surface of what’s possible.”

Prasad leads Alexa’s research and development in speech recognition, natural language understanding, and machine learning technologies, all in hopes of bettering users’ experiences with Echo devices. Since November 2014, Prasad and his team have shown that far-field speech recognition, even in loud environments, is possible with a high degree of accuracy. The reason for this, the executive says, is that Amazon has managed to develop a series of machine learning algorithms, data, and “immense computing power.” While conversation artificial intelligence (A.I.) has been a topic of interest among researchers for nearly five decades, it has historically been difficult for machines to not only understand, but also communicate in human language. As a result, Alexa’s ability to comprehend and respond to a “wide array of intents” makes her particularly impressive, Prasad noted.

So how exactly does Alexa work? First and foremost, your Echo device listens for a spoken audio cue, which is then converted to text by far-field automatic speech recognition (ASR) in the Amazon Web Services (AWS) cloud. Then, Alexa leverages natural language understanding (NLU) to convert these words into what Prasad calls a “structured interpretation of intent that can be used to respond to the user from the more than 30,000 Alexa skills built by first- and third-party developers.” This interpretation is coupled with certain contexts, like what kind of device the speaker is using, who the speaker is, or the most likely skills capable of providing a response. This context ultimately helps decide what Alexa’s next action ought to be, whether it’s a response or to ask for more information.

Alexa then responds using text-to-speech synthesis (TTS), which helps to translate strings of words into intelligible audio. Of course, the challenge here is to ensure that Alexa responds not only accurately, but quickly as well. As Prasad noted, As scientists and engineers we’re always battling this healthy tension between accuracy and latency from when the user stops speaking to Alexa to when she responds.”

So what is it that makes Alexa more capable than other smart assistants? Apparently, it has a lot to do with the fact that Alexa lives mostly in the cloud, which means the more you talk to her, the smarter she becomes. The smart assistants employs a range of learning techniques, but Prasad pointed out that Alexa “scientists and engineers are continually applying and inventing new learning techniques,” including what’s called transfer learning, which allows Alexa to apply lessons learned from one skill to another, or even one language to another.

And as far as what we can look forward to from not only Alexa, but smart assistants as a whole, Prasad has a few ideas. “A.I. will have deep societal impact and will help humans learn new skills that we can’t even imagine today,” he said. “In the next five years, we will see conversational A.I. get smarter on multiple dimensions as we make further advances with machine learning and reasoning. With these advances, we will see Alexa become more contextually aware in how she recognizes, understands, and responds to requests from users.” Ultimately, Prasad envisions a future in which the smart assistant will be able to engage in conversations regarding current events and other everyday topics. You can already check out what is possible by saying, “Alexa, let’s chat.” You may just be surprised by what you learn.

Lulu Chang
Former Digital Trends Contributor
Fascinated by the effects of technology on human interaction, Lulu believes that if her parents can use your new app…
Nest Doorbell vs. Ring Battery Doorbell Plus: which is the better video doorbell?
The Ring Battery Doorbell Plus installed outside a front door.

Ring and Nest are responsible for some of the best video doorbells available. With easy-to-use smartphone apps, simple installation processes, and the ability to customize your motion alerts, the Ring Battery Doorbell Plus and Nest Doorbell have quickly established themselves as two of the best video doorbells money can buy.

But what exactly is the difference between these two popular gadgets? And which is better for your smart home?

Read more
The best Apple HomeKit devices for 2023
A person unlocking the Aqara U100 smart lock with their phone.

While not as widespread as Google Home or Amazon Alexa, Apple HomeKit remains one of the most popular smart home ecosystems of 2023. The software plays well with iOS devices, and several other gadgets such as smart lights, smart locks, thermostats, and cameras can be controlled using the fancy technology. If you’re looking to build your smart home around Apple’s ecosystem, here are the best HomeKit devices available today.
Locks

HomeKit doesn’t have the largest selection of smart locks, but that doesn’t really matter when you have something as well-rounded as the Aqara Smart Lock U100. Not only does it offer full HomeKit support, but you’ll even gain access to Apple home keys -- allowing you to unlock your door with your iPhone or Apple Watch. There’s also the standard keypad for entering a passcode, along with a fingerprint sensor that can store several dozen fingerprints (so your whole family can enter the home without worrying about forgetting their password or smartphone).

Read more
Secure your home with Ring Floodlight Cam Plus and save $80
Ring Floodlight Camera placed on a wall outside.

Best Buy has one of the best security camera deals at the moment with $80 off the Ring Floodlight Cam Plus Outdoor Wired Surveillance Camera. Usually priced at $200, it's down to $120 for a limited time only so if you're fast enough, you'll save a lot of cash. If you're looking to secure your home, keep reading while we tell you all about the advantages this security camera offers.

Why you should buy the Ring Floodlight Cam Plus
Considered to be one of the best floodlight cameras for someone seeking a hard-wired solution, the Ring Floodlight Cam Plus Outdoor Wired Surveillance Camera is a highly effective home security measure.

Read more