The Open Neural Network Exchange (ONNX) provides a standardized format for machine learning models, enabling seamless deployment across various platforms and systems.
While Large Language Models (LLMs) are typically built in Python with frameworks such as PyTorch or TensorFlow before being exported to ONNX for inference, this session challenges that norm by showcasing Java's potential in AI model creation.
Discover how to generate ONNX models directly in Java, a language rarely linked to AI development, using Project Babylon's innovative code reflection features.
Through a practical example, we'll build a generative AI LLM in Java, walk through its conversion to ONNX format, and demonstrate efficient execution, bridging the gap between Java ecosystem and cutting edge AI workflow.
Type: Learning Session (50 min)
Track: Machine Learning and Artificial Intelligence
Audience Level: Expert
Speaker: Adam Sotona