Skip to main content

Voice recognition for kids isn’t child’s play, but this company has mastered it

SoapBox Voice Recognition
Voice recognition plays a big part in making our homes and the products in them smarter, and in the same way we are pushing through that period of adjustment needed to confidently interact verbally with a device, the technology behind it is also changing. It has to deal with different languages, accents, and voice characteristics to function reliably. SoapBox Labs, a speech recognition company, is an expert in children’s voices, and has been working on solving the problem of devices recognizing kids when they speak.

The result is a new Application Programming Interface (API) for developers to use inside everything from connected toys and VR games, to the Skills that power the Amazon Echo. Soapbox Lab’s API is made specifically to recognize children aged between four and 12 years old, picking up on their unique voices, their tendency to shout instructions, and the speech patterns of someone so young.

Wondering if there is much of a difference between children and adults speaking the same language? The vast majority of voice recognition systems available now were built for adults, by adults, and using speech data collected from adults. SoapBox collected speech data from children, then used its expertise in voice recognition to create custom algorithms and speech models that power the interface.

SoapBox Lab founder and CEO Patricia Scanlon Ph.D told Digital Trends: “Young kids are wildly unpredictable in their speech behaviors. While our team previously had significant experience in speech recognition for adults, building our platform specifically for kids over the past few years was a continuous challenge. It is like dealing with a completely different language!”

Uses beyond interactive toys

Where will SoapBox’s API be used? The most obvious place would be inside smart, connected toys. Kids would soon lose interest in a toy that promised to listen and respond to commands, but failed to catch what they said because the voice recognition was adapted from a program designed for adults. SoapBox’s system could effectively turn the toy into another child, in tune with what other kids say, and has the ability to converse rather than solely react to commands.

Beyond interactive toys, SoapBox Labs sees great potential in schools and learning tools. Scanlon continued:

“We were motivated to build this technology as parents ourselves and realize this technology will play a big part in our children’s lives. We want to make that experience safer, more enjoyable, and [more] engaging for them. Our technology can not only voice-enable home devices, games, and toys for kids, the same underlying technology can also enable personalized learning for reading and language tutors.”

SoapBox Voice Recognition
SoapBox Labs founder and CEO Patricia Scanlon

This is important, and gives SoapBox Lab’s voice system a real higher purpose. The system has an assessment tool inside, providing real-time feedback on reading, literacy, and language. Built into tutoring apps, the API could be used in classrooms and other learning environments.

Helping voice controlled devices better understand children is great when whatever it is they’re talking to is designed for them; but not so good when they decide to ask Alexa to deliver all those newly released Lego sets and charge it to your credit card. SoapBox is well aware of the problem, and its technology can be used to help avoid this situation.

“Our technology can be used to detect kids’ voices and direct their commands to a dedicated and safe voice interface just for kids,” said Scanlon, “This can be part of an existing home device ecosystem or app, allowing kids to only access specific skills, and the device to respond appropriately to kids’ voices.”

SoapBox Lab’s cloud-based API is available for developers to use now, and Scanlon is particularly interested in hearing about projects that have a “real social impact,” with the possibility of offering free use of the platform in the right cases. We can expect to see the first products with SoapBox Lab’s voice recognition launch during the first three months of 2018.

Andy Boxall
Senior Mobile Writer
Andy is a Senior Writer at Digital Trends, where he concentrates on mobile technology, a subject he has written about for…
I record interviews for work. These are my favorite free recorder apps
The iPhone 14 Pro and Google Pixel 7 Pro's voice recording apps running together.

The Voice Recorder app on a phone (left) and the Voice Memos on another phone Andy Boxall / Digital Trends

Before you head to the app store on your phone to buy a voice-recording app, take a moment to consider the apps that may already be installed on your phone. Why? In my experience, they're likely all you really need. I’ve recorded interviews and voice-overs for work for years, and I’ve found the two best examples come preinstalled on your phone already, so they’re entirely free to use.

Read more
The best Samsung Galaxy Z Fold 5 cases: 10 best ones so far
Two Galaxy Z Fold 5 phones next to each other -- one is open and one is closed.

Samsung’s next-generation foldable is here with the Samsung Galaxy Z Fold 5. This iteration has some notable improvements, including a new hinge design that eliminates the gap from previous generations when the device was folded. You also get a 6.2-inch HD+ Dynamic AMOLED 2X display on the outside while having a 6.7-inch QXGA+ Dynamic AMOLED 2X display on the inside, with both screens having a 120Hz refresh rate. In other words, they're about as nice as you could ask for.

The Galaxy Z Fold 5 is made with premium materials, and the triple-lens camera system packs in a 50MP main shooter, 10MP telephoto with 3x optical zoom, and a 12MP ultrawide lens. There’s a 10MP selfie camera on the front cover, and a 4MP camera on the inner display. You also get a Snapdragon 8 Gen 2 for Galaxy chip inside for the best performance and power efficiency.

Read more
Google Pixel Tablet just got its first big discount and it’s worth a look
Google Pixel Tablet on its charging dock.

Tablets are a dime-a-dozen these days, with offerings from all the great brands including Apple, Samsung, Lenovo, and more. So, if you really want to stand out in a sea of similar tech, you need to do things a little differently. That's what Google's Pixel Tablet offers. How? It comes with a unique speaker dock that can be used to both charge the device and offer room-filling sound -- almost like a smart speaker add-on. Better yet, when your Pixel Tablet is docked it benefits from the Hub Mode, turning the device into a smart display, with digital photo frame support, smart home controls, and hands-free Google functionality. Of course, it could set you back at full price, normally $499 unless you find it included in a roundup of the best Google Pixel deals. Well, guess what? Thanks to a Best Buy Google Pixel Tablet deal, you can get it today for $439 and save $60. Hurry, though, it's part of Best Buy's recent 48-hour sale so it won't stick around for long.

Why you should buy the Google Pixel Tablet
Okay, okay, so in our Google Pixel Tablet review, Joe Maring did give it less than stellar remarks, but he called out its reliable fingerprint sensor, comfortability during use and excellent speaker dock. Honestly, how many tablets come with a matching speaker dock that transforms the entire experience? This tablet also marks a "lot of firsts" for Google, as it's the first tablet from the company in nearly five years, the first Android tablet in eight years, and can be converted into a smart home display with the speaker dock. All of which are notable milestones.

Read more