Close Menu
5gantennas.org5gantennas.org
  • Home
  • 5G
    • 5G Technology
  • 6G
  • AI
  • Data
    • Global 5G
  • Internet
  • WIFI
  • 5G Antennas
  • Legacy

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
5gantennas.org5gantennas.org
  • Home
  • 5G
    1. 5G Technology
    2. View All

    Deutsche Telekom to operate 12,500 5G antennas over 3.6 GHz band

    August 28, 2024

    URCA Releases Draft “Roadmap” for 5G Rollout in the Bahamas – Eye Witness News

    August 23, 2024

    Smart Launches Smart ZTE Blade A75 5G » YugaTech

    August 22, 2024

    5G Drone Integration Denmark – DRONELIFE

    August 21, 2024

    Hughes praises successful private 5G demo for U.S. Navy

    August 29, 2024

    GSA survey reveals 5G FWA has become “mainstream”

    August 29, 2024

    China Mobile expands 5G Advanced, Chunghwa Telecom enters Europe

    August 29, 2024

    Ateme and ORS Boost 5G Broadcast Capacity with “World’s First Trial of IP-Based Statmux over 5G Broadcast” | TV Tech

    August 29, 2024
  • 6G

    India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

    August 29, 2024

    Vodafonewatch Weekly: Rural 4G, Industrial 5G, 6G Patents | Weekly Briefing

    August 29, 2024

    Southeast Asia steps up efforts to build 6G standards

    August 29, 2024

    Energy efficiency as an inherent attribute of 6G networks

    August 29, 2024

    Finnish working group launches push for 6G technology

    August 28, 2024
  • AI

    Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

    August 29, 2024

    Why Honeywell is betting big on Gen AI

    August 29, 2024

    Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

    August 29, 2024

    “Elon Musk and Trump” arrested for burglary in disturbing AI video

    August 29, 2024

    Nvidia CFO says ‘enterprise AI wave’ has begun and Fortune 100 companies are leading the way

    August 29, 2024
  • Data
    1. Global 5G
    2. View All

    Global 5G Enterprise Market is expected to be valued at USD 34.4 Billion by 2032

    August 12, 2024

    Counterpoint predicts 5G will dominate the smartphone market in early 2024

    August 5, 2024

    Qualcomm’s new chipsets will power affordable 5G smartphones

    July 31, 2024

    Best Super Fast Download Companies — TradingView

    July 31, 2024

    Crypto Markets Rise on Strong US Economic Data

    August 29, 2024

    Microsoft approves construction of third section of Mount Pleasant data center campus

    August 29, 2024

    China has invested $6.1 billion in state-run data center projects over two years, with the “East Data, West Computing” initiative aimed at capitalizing on the country’s untapped land.

    August 29, 2024

    What is the size of the clinical data analysis solutions market?

    August 29, 2024
  • Internet

    NATO believes Russia poses a threat to Western internet and GPS services

    August 29, 2024

    Mpeppe grows fast, building traction among Internet computer owners

    August 29, 2024

    Internet Computer Whale Buys Mpeppe (MPEPE) at 340x ROI

    August 29, 2024

    Long-term internet computer investor adds PEPE rival to holdings

    August 29, 2024

    Biden-Harris Administration Approves Initial Internet for All Proposals in Mississippi and South Dakota

    August 29, 2024
  • WIFI

    4 Best Wi-Fi Mesh Networking Systems in 2024

    September 6, 2024

    Best WiFi deal: Save $200 on the Starlink Standard Kit AX

    August 29, 2024

    Sonos Roam 2 review | Good Housekeeping UK

    August 29, 2024

    Popular WiFi extender that eliminates dead zones in your home costs just $12

    August 29, 2024

    North American WiFi 6 Mesh Router Market Size, Share, Forecast, [2030] – அக்னி செய்திகள்

    August 29, 2024
  • 5G Antennas

    Nokia and Claro bring 5G to Argentina

    August 27, 2024

    Nokia expands FWA portfolio with new 5G devices – SatNews

    July 25, 2024

    Deutsche Telekom to operate 12,150 5G antennas over 3.6 GHz band

    July 24, 2024

    Vodafone and Ericsson develop a compact 5G antenna in Germany

    July 12, 2024

    Vodafone and Ericsson unveil new small antennas to power Germany’s 5G network

    July 11, 2024
  • Legacy
5gantennas.org5gantennas.org
Home»AI»Nvidia unveils blueprint for next-generation generative AI
AI

Nvidia unveils blueprint for next-generation generative AI

5gantennas.orgBy 5gantennas.orgAugust 27, 2024No Comments6 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email



Hardware is always a main focus of Nvidia’s GPU technology conferences, and this year saw a preview of the “Blackwell” data center GPU, which is the basis of the 2025 platform that includes the “Grace” CPU, NVLink Switch 5 chip, Bluefield-3 DPU, and other components — all components Nvidia will be highlighting again at its Hot Chips 2024 conference this week.

What has received less attention is Nvidia’s NIM strategy to make it easier and faster for developers to create AI applications. There’s been a lot of buzz about Nvidia Inference Microservices, but with the likes of Blackwell on the horizon, it’s hard to get traction.

Still, NIM is important to Nvidia’s larger plans to enable users to develop AI software with generative AI tools like chatbots. Nvidia says NIM delivers everything software engineers need in a container-like environment as pre-built microservices that can be deployed to the cloud, datacenters, workstations and other systems. Built on Kubernetes, the NIM container includes open source large-scale language models, a cloud-native stack, Nvidia’s TensorRT and TensorRT-LLM, Triton inference server and standard APIs, and is part of Nvidia’s larger AI enterprise strategy.

Justin Boitano, vice president of enterprise AI software products at Nvidia, said NIM is part of what he calls the “second wave of generative AI,” occurring in enterprises and enabling companies to leverage organizational knowledge to run their business, engage with customers, and innovate much faster.The first wave, fueled by the enthusiasm following the release of OpenAI’s ChatGPT in late November 2022, was driven by foundational modelers and concerned with embedding generative AI in internet services and improving individual productivity through writing languages ​​and code.

In this new wave, “generative AI will help teams understand complex business processes and supply chain dependencies and bring new products and services to market at a speed no company has been able to achieve before,” Boitano told journalists and analysts in a briefing ahead of the Hot Chips show in California this week. “This started with the introduction of open models such as Meta Platforms’ Lama 3.1. These models represent an incredible advancement, giving companies a new level of intelligence that was almost unimaginable running in the data center just a few years ago.”

NIM was created to make it possible to run such models at scale, in production, and securely, he said, adding that Nvidia is currently working with various AI model builders to use NIM to essentially give their models a high-performance, efficient runtime.

“These NIMs deliver performance optimizations, delivering token throughput efficiency two to five times faster than other solutions, optimizing the total cost of ownership for businesses running generative AI on Nvidia systems,” Boitano said. “By working with an ecosystem of community model builders, proprietary model builders and our own models, we ensure every modality for every business works seamlessly, resulting in the best token efficiency for customers using Nvidia AI Enterprise.”

At Hot Chips, Nvidia is taking another step with NIM, introducing NIM Agent Blueprints for developers who want to create custom generative AI applications. These are reference AI workflows that include sample applications based on NIM and partner microservices, reference code, documentation outlining customizations, and Helm charts (files that detail the resources for a Kubernetes cluster and package them as an application) for deploying the apps. Developers can modify the blueprints.

“It’s a catalog of reference applications built for common use cases, codifying best practices Nvidia has learned from its experience with early adopters. Nvidia NIM blueprints are executable AI workflows that are pre-trained for specific use cases and can be modified by any developer. They’re a starting point for executing what we believe are some of the most critical business tasks in enterprises,” Boitano said.

The NIM Blueprint is part of what Nvidia calls a “data flywheel,” which goes beyond accelerating models. Models must be enhanced and customized to address the specific needs of an organization and its use cases. In the flywheel idea, as an AI application runs and interacts with users, it generates data that can be fed back into the process and used to improve the model in a continuous learning cycle, he said.

“Nvidia NeMo is the engine that powers this flywheel,” Boitano said, adding, “The Nvidia AI Foundry is the factory that powers the NeMo flywheel, and these customized generative AI applications enable businesses to deliver better, higher-quality experiences to their customers and employees.”

He added, “The application building process actually starts with NIM, but to build a data flywheel, the Nvidia NeMo framework can be used to curate data, customize models and evaluate them to power the application and bring it back to production. NeMo accelerates all the compute-intensive stages of the generative AI app development lifecycle, and we have a broad partner ecosystem building on top of NeMo and NIM, making it easy for enterprises to develop their own generative AI applications.”

Since the early days of generative AI efforts, organizations have spoken about the need to be able to customize their AI efforts by incorporating enterprise data into the training and inference mix. This impetus gave birth to Search Augmented Generative (RAG).

Nvidia will initially release blueprints for three scenarios, including Digital Humans for Customer Experience (creating 3D digital humans that can interact with users, enabling multi-channel communication and connecting to the RAG system), and multi-modal PDF data extraction for enterprise RAG.

“Trillions of PDFs are generated across enterprises every year, and these PDFs contain multiple data types, including text, images, graphs, tables, and more,” he said. “The Multimodal PDF Data Extraction Blueprint helps organizations accurately extract the knowledge contained within vast amounts of enterprise data, allowing users to effectively access this data through chat interfaces and quickly turn digital humans into experts on any topic, empowering employees to make smarter, faster decisions.”

Finally, accelerating drug discovery using generative AI to simulate molecules that can target and bind to proteins.

Nvidia has brought on Accenture, Deloitte, SoftServe, Quantiphi and World Wide Technology to provide NIM Agent Blueprints, Dataiku and DataRobot for fine-tuning models and monitoring, LlamaIndex and Langchain for workflow building, Weights and Biases for application assessment, CrowdStrike, Datadog, Fiddler AI, New Relic and Trend Micro for cybersecurity. Enterprise portfolios from Nutanix, Red Hat and Broadcom support the blueprints.

They also run on systems from OEMs such as Cisco, Dell Technologies, Hewlett Packard Enterprise and Lenovo, as well as hyperscale systems from Amazon Web Services, Google Cloud, Azure and Oracle Cloud Infrastructure.

Sign up for our newsletter

We’ll deliver the week’s highlights, analysis and stories straight to your inbox, with no middle ground.
Subscribe now

Related articles



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHunters International ransomware group threatens to leak US Marshals data
Next Article RANGE launches study to identify internet access needs in the Panhandle
5gantennas.org
  • Website

Related Posts

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Why Honeywell is betting big on Gen AI

August 29, 2024

Ethically questionable or creative genius? How artists are engaging with AI in their work | Art and Design

August 29, 2024

Comments are closed.

Latest Posts

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024

Crypto Markets Rise on Strong US Economic Data

August 29, 2024
Don't Miss

Apple focuses on 6G for future iPhones

By 5gantennas.orgDecember 11, 2023

iPhone 15 Pro and Pro MaxWith Apple’s recent listing of cellular platform architects to work…

All connectivity technologies will be integrated in the 6G era, says Abhay Karandikar, DST Secretary, ET Telecom

January 31, 2024

5G-Advanced and 6G networks require additional spectrum

January 24, 2024

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to 5GAntennas.org, your reliable source for comprehensive information on 5G technology, artificial intelligence (AI), and data-related advancements. We are passionate about staying at the forefront of these cutting-edge fields and bringing you the latest insights, trends, and developments.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

4 Best Wi-Fi Mesh Networking Systems in 2024

September 6, 2024

India is on the brink of a new revolution in telecommunications and can lead the world with 6G: Jyotiraditya Scindia

August 29, 2024

Speaker Pelosi slams California AI bill headed to Governor Newsom as ‘ignorant’

August 29, 2024
Most Popular

Will 5G make 2024 the most connected year in the industry?

December 1, 2023

The current state of 5G in the US and how it can improve

September 28, 2023

How 5G technology will transform gaming on the go

January 31, 2024
© 2025 5gantennas. Designed by 5gantennas.
  • Home
  • About us
  • Contact us
  • DMCA
  • Privacy Policy
  • About Creator

Type above and press Enter to search. Press Esc to cancel.