News

Microsoft Boosts AI-Driven Vision, Search Services in Azure Cloud

Microsoft announced several enhancements and new offerings for its Cognitive Services cloud APIs that help developers leverage artificial intelligence (AI) capabilities in mobile apps.

Microsoft Cognitive Services provide the base of the company's AI platform, letting developers incorporate AI functionality for vision, speech, language, knowledge and search into iOS and Android apps and edge devices.

Here's a look at what exec Joseph Sirosh announced March 1.

  • A public preview of Custom Vision services available via the Azure Portal. This helps developers train their own classifiers, providing data to group different images in order to create machine learning models for use in apps.

    "We built Custom Vision with state-of-the-art machine learning that offers developers the ability to train their own classifier to recognize what matters in their scenarios," Sirosh said.

    Concrete examples of real-world scenarios in which this technology can be used listed by Sirosh include:

    • A retailer can easily create models to auto-classify catalog images of different kinds of clothing, such as dresses, shoes and so on.
    • A social media site can more effectively filter and classify images of specific products.
    • A national park can detect whether photos captured by automatic cameras include wild animals or not.

  • The already-available Face API -- which can identify faces and other characteristics like emotion, eyeglasses, gender and so on -- has been improved, primarily with million-scale recognition. That means classes of identifiable people or faces can scale up to 1 million.

  • Bing Entity Search is now generally available in the Azure Portal. This API lets developers infuse knowledge search into existing content to help identify entities associated with a search term. Such entities can include: famous people; locations; various types of media such as movies, TV shows, video games, books and so on; nearby local businesses; and more.

    Practical examples enables by this technology include:

    • A messaging app could provide an entity snapshot of a restaurant, making it easier for a group to plan an evening.
    • A social media app could augment users' photos with information about the locations of each photo.
    • A news app could provide entity snapshots for entities in an article.
    • A music app could augment content with snapshots of artists and songs.
    • A camera app could use Computer Vision API to detect entities in an image and then use Entity Search API to provide more context about those entity inline, and so on.

"Today's milestones illustrate our commitment to make our AI Platform suitable for every business scenario, with enterprise-grade tools to make application development easier and respecting customers' data," Sirosh said.

About the Author

David Ramel is an editor and writer at Converge 360.

comments powered by Disqus

Featured

  • VS Code v1.99 Is All About Copilot Chat AI, Including Agent Mode

    Agent Mode provides an autonomous editing experience where Copilot plans and executes tasks to fulfill requests. It determines relevant files, applies code changes, suggests terminal commands, and iterates to resolve issues, all while keeping users in control to review and confirm actions.

  • Windows Community Toolkit v8.2 Adds Native AOT Support

    Microsoft shipped Windows Community Toolkit v8.2, an incremental update to the open-source collection of helper functions and other resources designed to simplify the development of Windows applications. The main new feature is support for native ahead-of-time (AOT) compilation.

  • New 'Visual Studio Hub' 1-Stop-Shop for GitHub Copilot Resources, More

    Unsurprisingly, GitHub Copilot resources are front-and-center in Microsoft's new Visual Studio Hub, a one-stop-shop for all things concerning your favorite IDE.

  • Mastering Blazor Authentication and Authorization

    At the Visual Studio Live! @ Microsoft HQ developer conference set for August, Rockford Lhotka will explain the ins and outs of authentication across Blazor Server, WebAssembly, and .NET MAUI Hybrid apps, and show how to use identity and claims to customize application behavior through fine-grained authorization.

  • Linear Support Vector Regression from Scratch Using C# with Evolutionary Training

    Dr. James McCaffrey from Microsoft Research presents a complete end-to-end demonstration of the linear support vector regression (linear SVR) technique, where the goal is to predict a single numeric value. A linear SVR model uses an unusual error/loss function and cannot be trained using standard simple techniques, and so evolutionary optimization training is used.

Subscribe on YouTube