Home → News → Project Oxford Apis Offer Facial, Voice Recognition... → Full Text

Project Oxford Apis Offer Facial, Voice Recognition Capabilities

By Government Computer News

May 21, 2015

[article image]

Microsoft's age-guessing site, How-Old.net, is powered by the cloud-based Project Oxford, which now offers tools developers can use to incorporate intelligence into their apps.

Project Oxford's application programming interfaces (APIs) and software development kits enable developers to add intelligence technologies, such as facial and speech recognition, into their solutions that leverage Microsoft's natural data understanding.

The face API, for example, uses algorithms to detect and recognize human faces in images, with the ability to determine gender and age. Developers can check whether two images of faces are similar enough to be the same person (authentication) or to compare an image to a user-provided face database.

The Project Oxford speech API uses algorithms to process spoken language into text and text into audio, which could be used for such tasks as initial call center screening. Files of spoken audio are transmitted to the Microsoft's servers in the cloud, and a single text result is returned.

The vision API generates visual features based on the input image's visual content, reads text from an image, and creates thumbnail versions of the original image that are tailored to specific needs.

Project Oxford also includes the Language Understanding Intelligent Service, offered as an invitation-only beta, that helps applications understand what users mean when they say or type something using natural, everyday language.

From Government Computer News
View Full Article


Abstracts Copyright © 2015 Information Inc., Bethesda, Maryland, USA


No entries found