AI Foundation Models
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/ngc-llm-featured-960x540.jpg)
Jul 25, 2024
Revolutionizing Code Completion with Codestral Mamba, the Next-Gen Coding LLM
In the rapidly evolving field of generative AI, coding models have become indispensable tools for developers, enhancing productivity and precision in software...
5 MIN READ
![Decorative image of a llama in cool sunglasses against a sunny landscape.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/llama-sunglasses-featured-960x540.jpg)
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
![Illustration representing Phi-3-Medium.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/Phi-3-Medium-960x540.png)
Jul 02, 2024
Phi-3-Medium: Now Available on the NVIDIA API Catalog
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/07/abstract-lines-1-960x540.jpg)
Jul 01, 2024
StarCoder2-15B: A Powerful LLM for Code Generation, Summarization, and Documentation
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/ngc-press-gemma-2-model-1920x10801-1-960x540.jpg)
Jul 01, 2024
Google's New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/abstract-chart-960x540.jpg)
Jun 28, 2024
Transforming Financial Analysis with NVIDIA NIM
In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
13 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/Llama3-NVIDIA-API-catalog-960x540.jpg)
Jun 26, 2024
Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA
Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/medical-imaging-sdg-1-960x540.jpg)
Jun 24, 2024
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/Mistral-Codestral-GenAI-model-NVIDIA-960x540.jpg)
Jun 17, 2024
Simplify and Accelerate Programming Tasks with Mistral’s Codestral GenAI Model
Experience Codestral, packaged as an NVIDIA NIM inference microservice for code completion, writing tests, and debugging in over 80 languages using the NVIDIA...
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/SDXL_Lightning_quicklink-960x540.jpeg)
Jun 10, 2024
Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog
Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/Solar_Quicklink-960x540.jpeg)
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/Breeze_API-NVIDIA-featured-960x540.jpg)
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/06/BGE_APICatalog-NVIDIA-featured-960x540.jpg)
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
![](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/05/NVIDIA-CodeGemma-foundation-model-960x540.jpeg)
May 30, 2024
Convert Natural Language to Code with CodeGemma
Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
![Stylized image of a smartphone chat with a young woman smiling off to one side.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/05/Gipi-TechBlog-Featured-Image-960x540.png)
May 30, 2024
Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models
Over 1.2B people are actively learning new languages, with over 500M learners on digital learning platforms such as Duolingo. At the same time, a significant...
6 MIN READ
![Decorative image of overlapping spheres.](https://cdn.statically.io/img/developer-blogs.nvidia.com/wp-content/uploads/2024/05/phi3-llm-featured-960x540.png)
May 28, 2024
Create Content, Conversations, and Code with New Phi-3 and Granite Code Model Families
Generative AI is revolutionizing virtually every use case across every industry, thanks to the constant influx of groundbreaking foundation models capable of...
3 MIN READ