About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
What is Large Scale Generative AI?
Related Media
Whether you're dealing with large language models or seeking efficient
ways to handle high request volumes, you need to know how to manage and
optimize your AI infrastructure.
Join Aaron Baughman as he explores advanced strategies for scaling
generative AI algorithms across GPUs. Aaron covers batch-based and
cache-based systems, agentic architectures, and model distillation
techniques and explains how you can use these methods to optimize
performance, reduce latency, and enhance personalization in AI
applications.
- Tags
- Appears In
Loading