Skip to main content

Google Glass meets Kinect in ARI, a gesture-recognition app for smartglasses

Google Glass meets Kinect in ARI

Google Glass took smartglasses from the realm of pony-tailed academics and Silicon Valley dreamers to mainstream reality. It has dozens of apps. It has new designs from Oakley and Ray-Ban. It has fans that include both runway models and basketball players.

And if you’re in a loud room, it’s almost as worthless as it is geeky. Because talking to your gadgets is nowhere near as smooth at David Hasselhoff made it look in Knight Rider.

Recommended Videos

Sure, you can bark “Glass, take a photo” in the comfort of your living room and snap Mr. Meowsers without setting down your tea. But trying the same thing in a bustling bar is a surefire way to annoy the people around you — and perhaps get a beatdown. Meanwhile, Glass’s touch-sensitive pad handles only a few basic functions.

Quickly evolving smartglass technology could also help put a spring in ARI’s rather lethargic step.

But what if Glass could read hand gestures like Microsoft’s Kinect? Soon, it will. A Portland, Ore., startup called On the Go Platforms is developing a way to control your smartglasses with a form of sign language, and on Wednesday, the company released its first public beta.

“The smartphone is moving up into your vision, and there needs to be a new interface to evolve with the new hardware,” says cofounder Ryan Fink. “That’s where ARI comes in.”

ARI, short for Augmented Reality Interface, requires no buttons, no touchpads, and no speaking. “ARI is the Siri of gesture recognition for smartglasses,” explains Fink.

Hold your fist up in front of your face (like an old-timey boxer looking for a fight) and a box appears superimposed over it in Google Glass. After recognizing it as a command, ARI counts down 3 seconds so you can get your hand out of the way, then snaps a photo. After collecting a few, you can use the wave of your hand to leaf through Glass’s library of photos as if turning invisible pages in a photo album.

That’s just one use. Ultimately, the team hopes ARI will be able to recognize an entire library of gestures, which outside developers can bake into their apps as controls. The Pandora app, for example, could someday interpret a literal thumbs up as your approval for that Whitesnake jam it just played on your 80s station.

A new way to control Google Glass

Besides simply being less irritating than talking to your glasses, Fink and cofounder Gary Peck see it as an essential way to control Glass in scenarios where it would otherwise never work, like a factory floor.

“It’s really loud, they sometimes have stuff in their hands and they want to be able to easily interact with the content or check things off as they go through,” Fink explains. With ARI, a worker could simply wave a hand to move the display in his glasses onto the next instruction in a list, or hold up a fist to mark it done.

The same goes for athletes. A snowboarder with thick gloves might have difficulty swiping the touchpad, and a biker might not want to lift his hands from the handlebars to start a timer.

“I think at first it will be a little weird, especially to people that aren’t familiar with the technology.”

ARI also has the potential to make Glass a platform for games. After all, no one wants to control a game by frantically drumming on their glasses or yelling “Flappy Bird, up!” But as Kinect has proven, gamers don’t seem to mind flailing their arms around.

In theory, ARI solves all these issues. In execution, there are still plenty of kinks to iron out. The early alpha version we tested took a while to recognize gestures, and it takes a purposeful execution to pull off a swipe or fist in just the way ARI is looking to see it.

But quickly evolving smartglass technology could also help put a spring in ARI’s rather lethargic step. Since it constantly analyzes images from the built-in Glass camera, ARI can’t run on your phone; it has to run on the anemic processor built into Glass. As those processors get faster, so will ARI.

Glass’s camera also remains a challenge — because there’s only one. Purpose-built gesture-control systems like Microsoft’s Kinect rely on a pair of cameras to generate stereo images, which lets the computer reading your manic motions determine depth. “With dual cameras, you really get a 3D model of the world in front of you. With a single camera it’s just a plain image,” Peck explains “You have no idea that this hand is a different object from the table, it’s all just different pixels.” Peck has had to work around that primitive input to detect objects based on the look of them alone.

Google Glass meets Kinect in ARI

The challenge is made even more difficult by the need to preserve Google Glass’s limited battery life: The more accurate the gesture detection, the more it drains the battery. “That’s the tradeoff,” Peck said. “How can you get accuracy that’s good enough for the gestures you’re trying to do?” His code, for instance, uses low-res video from the onboard camera even though it can technically capture at 1080p — because he’s not trying to track every finger.

The geek factor here is also impossible to ignore: How do you get people to use a technology that makes you look like the world’s worst mime? “I think at first it will be a little weird, especially to people that aren’t familiar with the technology,” Fink acknowledges, but insists perceptions will change as the technology improves. “It will be much more like Iron Man or Minority Report, where it’s an immersive experience. So I think the stigmatism toward it being weird and awkward will melt away with that.”

The future of smartglasses and augmented reality remains smudgy.

One of the biggest steps forward will come from “waveguide lenses.” While Glass merely displays a small video feed in the corner of your vision, waveguide lenses can literally overlay information over your entire field of view, like true Terminator glasses. Vuzix has produced early prototypes of this technology and some expensive models meant for industrial environments, but they’ve yet to shrink to the size or price of Google Glass. Peck believes it could be at least another year before we see this level of display reach the mainstream.

The future of smartglasses and augmented reality remains smudgy, but the unknown next steps remain part of the appeal for Peck and Fink.

“It’s a new interaction paradigm. There’s a lot of figuring out — how do you present information in the most intuitive way? How do you interact with it? Peck explains. “But as an app developer, there’s no conventions. There’s no knowledge of the best way to present information that’s up here on your screen. Those are kind of interesting challenges.”

Nick Mokey
As Digital Trends’ Managing Editor, Nick Mokey oversees an editorial team delivering definitive reviews, enlightening…
The best portable power stations
EcoFlow DELTA 2 on table at campsite for quick charging.

Affordable and efficient portable power is a necessity these days, keeping our electronic devices operational while on the go. But there are literally dozens of options to choose from, making it abundantly difficult to decide which mobile charging solution is best for you. We've sorted through countless portable power options and came up with six of the best portable power stations to keep your smartphones, tablets, laptops, and other gadgets functioning while living off the grid.
The best overall: Jackery Explorer 1000

Jackery has been a mainstay in the portable power market for several years, and today, the company continues to set the standard. With three AC outlets, two USB-A, and two USB-C plugs, you'll have plenty of options for keeping your gadgets charged.

Read more
CES 2023: HD Hyundai’s Avikus is an A.I. for autonomous boat and marine navigation
Demonstration of NeuBoat level 2 autonomous navigation system at the Fort Lauderdale International Boat Show

This content was produced in partnership with HD Hyundai.
Autonomous vehicle navigation technology is certainly nothing new and has been in the works for the better part of a decade at this point. But one of the most common forms we see and hear about is the type used to control steering in road-based vehicles. That's not the only place where technology can make a huge difference. Autonomous driving systems can offer incredible benefits to boats and marine vehicles, too, which is precisely why HD Hyundai has unveiled its Avikus AI technology -- for marine and watercraft vehicles.

More recently, HD Hyundai participated in the Fort Lauderdale International Boat Show, to demo its NeuBoat level 2 autonomous navigation system for recreational boats. The name mashes together the words "neuron" and "boat" and is quite fitting since the Avikus' A.I. navigation tech is a core component of the solution, it will handle self-recognition, real-time decisions, and controls when on the water. Of course, there are a lot of things happening behind the scenes with HD Hyundai's autonomous navigation solution, which we'll dive into below -- HD Hyundai will also be introducing more about the tech at CES 2023.

Read more
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more