VeloxCon 2024, the premier developer convention that’s devoted to the Velox open-source undertaking, introduced collectively trade leaders, engineers, and lovers to discover the most recent developments and collaborative efforts shaping the way forward for information administration. Hosted by IBM® in partnership with Meta, VeloxCon showcased the most recent innovation in Velox together with undertaking roadmap, Prestissimo (Presto-on-Velox), Gluten (Spark-on-Velox), {hardware} acceleration, and far more.
An summary of Velox
Velox is a unified execution engine that’s constructed and open-sourced by Meta, aimed toward accelerating information administration techniques and streamlining their growth. One of many greatest advantages of Velox is that it consolidates and unifies information administration techniques so that you don’t have to hold rewriting the engine. Immediately Velox is in numerous levels of integration with a number of information techniques together with Presto (Prestissimo), Spark (Gluten), PyTorch (TorchArrow), and Apache Arrow. You’ll be able to learn extra about why Velox was inbuilt Meta’s engineering weblog.
Velox at IBM
Presto is the engine for watsonx.information, IBM’s open information lakehouse platform. During the last yr, we’ve been working arduous on advancing Velox for Presto – Prestissimo – at IBM. Presto Java staff are being changed by a C++ course of based mostly on Velox. We now have a number of committers to the Prestissimo undertaking and proceed to companion intently with Meta as we work on constructing Presto 2.0.
Among the key advantages of Prestissimo embody:
Hugh efficiency increase: question processing will be completed with a lot smaller clusters
No efficiency cliffs: no Java processes, JVM, or rubbish collections, as reminiscence arbitration improves effectivity
Simpler to construct and function at scale: Velox offers you reusable and extensible primitives throughout information engines (like Spark)
This yr, we plan to do much more with Prestissimo together with:
The Iceberg reader
Manufacturing readiness (metrics assortment with Prometheus)
New Velox system implementation
TPC-DS benchmark runs
VeloxCon 2024
We labored intently with Meta to prepare VeloxCon 2024, and it was a improbable neighborhood occasion. We heard audio system from Meta, IBM, Pinterest, Intel, Microsoft, and others share what they’re engaged on and their imaginative and prescient for Velox over two dynamic days.
Day 1 highlights
The convention kicked off with classes from Meta together with Amit Purohit reaffirming Meta’s dedication to open supply and neighborhood collaboration. Pedro Pedreira, alongside Manos Karpathiotakis and Deblina Gupta, delved into the idea of composability in information administration, showcasing Velox’s versatility and its alignment with Arrow.
Amit Dutta of Meta explored Prestissimo’s batch effectivity at Meta, shedding mild on the developments made in optimizing information processing workflows. Remus Lazar, VP Knowledge & AI Software program at IBM introduced Velox’s journey inside IBM and imaginative and prescient for its future. Aditi Pandit of IBM adopted with insights into Prestissimo’s integration at IBM, highlighting function enhancements and future plans.
The afternoon classes have been equally insightful, with Jimmy Lu of Meta unveiling the most recent optimizations and options in Velox. Whereas Binwei Yang of Intel mentioned the mixing of Velox with the Apache Gluten undertaking, emphasizing its world influence. Engineers from Pinterest and Microsoft shared their experiences of unlocking information question efficiency through the use of Velox and Gluten, showcasing tangible efficiency positive factors.
The day concluded with classes from Meta on Velox’s reminiscence administration by Xiaoxuan Meng and a glimpse into the brand new easy aggregation operate interface that was introduced by Wei He.
Day 2 highlights
The second day started with a keynote from Orri Erling, co-creator of Velox. He shared insights into Velox Wave and Accelerators, showcasing its potential for acceleration. Krishna Maheshwari from NeuroBlade highlighted their collaboration with the Velox neighborhood, introducing NeuroBlade’s SPU (SQL Processing Unit) and its transformative influence on Velox’s computational velocity and effectivity.
Sergei Lewis from Rivos explored the potential of offloading work to accelerators to boost Velox’s pipeline efficiency. William Malpica and Amin Aramoon from Voltron Knowledge launched Theseus, a composable, scalable, distributed information analytics engine, utilizing Velox as a CPU backend.
Yoav Helfman from Meta unveiled Nimble, a cutting-edge columnar file format that’s designed to boost information storage and retrieval. Pedro Pedreira and Sridhar Anumandla from Meta elaborated on Velox’s new technical governance mannequin, emphasizing its significance in guiding the undertaking’s growth sustainability.
The day additionally featured classes on Velox’s I/O optimizations by Deepak Majeti from IBM, methods for safeguarding in opposition to Out-Of-Reminiscence (OOM) kills by Vikram Joshi from ComputeAI, and a hands-on demo on debugging Velox functions by Deepak Majeti.
What’s subsequent with Velox
VeloxCon 2024 was a testomony to the colourful ecosystem surrounding the Velox undertaking, showcasing groundbreaking improvements and fostering collaboration amongst trade leaders and builders alike. The convention supplied attendees with beneficial insights, sensible data, and networking alternatives, solidifying Velox’s place as a number one open supply undertaking within the information administration ecosystem.
In case you’re fascinated by studying extra and becoming a member of the Velox neighborhood, listed here are some assets to get began:
Keep tuned for extra updates and developments from the Velox neighborhood, as we proceed to push the boundaries of knowledge administration and speed up innovation collectively.
Strive Presto with a free trial of watsonx.information
Was this text useful?
SureNo