Add a new backend: Bolt #10929
Replies: 13 comments 21 replies
-
|
Thank you to start the discussion, @WangGuangxin. For short term experiment, can we simply replace the VELOX_REPO to enable Bolt? Do we need to change any Gluten code? In long term, we may considered to add a new backend. |
Beta Was this translation helpful? Give feedback.
-
|
@WangGuangxin Great work! What about if we add streaming computation extension to your Bolt? since we are doing gluten on Flink project and it's easyer work with you guys than velox team. |
Beta Was this translation helpful? Give feedback.
-
|
@WangGuangxin Do you have any plan to release |
Beta Was this translation helpful? Give feedback.
-
|
@WangGuangxin also curious to see if the new Bolt backend is compatible with Apache Celeborn. I noticed you mentioned that bolt now handles the shuffle instead of gluten for more parallel processing. |
Beta Was this translation helpful? Give feedback.
-
|
@lgbo-ustc @afterincomparableyum Yes, Bolt source code, as well as the PR on Gluten integration, will be open source before Dec 6th(if not earlier). I will share the links here once it is ready. It is truly amazing to receive overwhelming interests and we would appreciate the test & feedback from the community to make Bolt better. |
Beta Was this translation helpful? Give feedback.
-
|
@WangGuangxin and @frankobe Thank you for this proposal. The community welcome this proposal and aprreciate ByteDance's interest in contributing Bolt to Apache Gluten. We're open to discussing the addition of a new backend. To move forward, please highlight the key differentiations between Bolt and Velox - both in architecture and performance, so the community understands the unique value Bolt brings. Official support for important feature like to add a new backend requires a formal community voting process, as this involves significant design and maintenance effort. Compliance with Apache licensing and IP clearance is essential, given Bolt's origin as a Velox fork. I would encourage sharing additional data for Bolt, including the source code for review, performance benchmarks, and the roadmap for ongoing maintenance to ensure long-term sustainability. |
Beta Was this translation helpful? Give feedback.
-
|
Bolt is open sourced: https://github.com/bytedance/bolt Feedbacks are welcomed! |
Beta Was this translation helpful? Give feedback.
-
|
@liuneng1994 Thanks for sharing your concerns in thread https://lists.apache.org/thread/9ry0jjydsvztnrosymlwzspdv4hdsvp1, I would like to justify it by providing more context
Bolt has proved enterprise-grade maturity through massive internal adoptions within ByteDance on multiple product lines. The ecosystem is strengthening through direct & transparent collaboration with not only Gluten project, but also with OpenSearch / Flink / Paimon / Celeborn etc. The maturity of project is the joint goal between Gluten & Bolt community, instead of the prerequisite on the code merge. Following the project history of Gazelle/Gluten, I witness how Gluten community pioneers to adopt existing backends at early, inmature stage and foster Flink & GPU support even nowadays so I would appreciate the similar standard on the Bolt backend.
Bolt is open to accepts commits from Gluten commits. Internally, every Bolt commit is checked against all DBMS integration including Spark-on-Gluten to ensure the stability. Currently Gluten
Similar to existing maintenance model on backends-clickhouse, Github Actions CI on Bolt backend will run on additional compute resources provided by Bolt project to ensure the task bandwidth. To further enhance the transparency, run history and logs are available for public access. You can check the health on existing Bolt Github action run. On the top of CI resource, as stated in the voting message, Bolt community will assign 3 dedicated members to maintain the stability of Gluten & Bolt integration.
Bolt deeply values the "Apache Way" on community governance since day 1. We are in the process of submitting a proposal to ASF incubator. |
Beta Was this translation helpful? Give feedback.
-
|
@yaooqinn Thanks for sharing the sincere concerns in thread https://lists.apache.org/thread/d4oq3oydrzcndyphvfh3gnr6v08jxvp9. The discussion on Bolt backed is opened on Oct 23, 2025 but I would love to provide more context just in case
Thanks for the correction. We will follow your recommendations in the next round of voting.
Both projects are following Apache 2.0 license, Bolt's dependency is declared in NOTICE.txt. We are here to address any specific legal concerns, though there are no known issues to our knowledge.
Bolt backend PR only fork and modify the existing
Bolt is open to accepts commits from Gluten commits. Internally, every Bolt commit is checked against all DBMS integration including Spark-on-Gluten to ensure the stability. Currently Gluten backends-velox depends on
Assuming you refer to "vendor-neutral", Bolt project is in the process of submitting a proposal to ASF incubator, targeting 26Q1. We deeply value and encourage contributions from Gluten community to build the future of native engine acceleration. The merge of Bolt backend to the main branch is a foundational step to expose Spark-on-Bolt capability for community adoption which in return pushes the project to be "vendor-neutral". |
Beta Was this translation helpful? Give feedback.
-
|
+1 The essence of open source is "all for one and one for all." The reason Gluten has reached its current level of maturity and widespread adoption is precisely because our countless users have resolved numerous bugs during their own deployments, preventing those who follow from repeating the same mistakes. However, the Velox community has consistently struggled with excessively long PR review times, or even a complete lack of reviews. The latest example is Issue #11534, which could have saved us from a massive amount of redundant testing recently. While we could cherry-pick these PRs into ibm/velox (or your specific fork), the current rebase effort is already becoming overwhelming. Consequently, we have had to create an issue to track unmerged PRs from the Gluten community so that customers can selectively pick them as needed. Since I began testing Bolt, I can confirm that everything mentioned by @frankobe is true. Introducing a new, more open backend will only benefit the long-term development of Gluten. |
Beta Was this translation helpful? Give feedback.
-
|
I suggest we postpone the discussion of adding Bolt as a new backend until Gluten itself graduates from the incubator. I have discussed this privately with @WangGuangxin as well. The primary reason is to minimize potential risks and uncertainties during Gluten's critical incubation phase. Regarding the ecosystem, given the issues we've seen with Velox, I strongly hope Bolt can eventually join the Apache Software Foundation or the Linux Foundation. The recent changes in the companies of Gluten maintainers serve as a perfect example: precisely because Gluten is an Apache project with clear governance, these personnel changes did not introduce significant risks to the project's continuity. I believe Bolt following a similar governance model would be crucial for its long-term health and integration. |
Beta Was this translation helpful? Give feedback.
-
|
+1 thanks for the proposal bolt definitely looks promising. It would be great to have these details into a Proposal doc which can outline some of the technical integration with Gluten, and get it reviewed from the community in depth. PR review times and hard to get reviews is a long standing problem, hopefully issues can be addressed |
Beta Was this translation helpful? Give feedback.
-
|
Guys, one thing to clarify and reminder, Bytedance is the first round company who joint Gluten community since we started. There are several Gluten committers from Bytedance and they also contributed much to our community. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone,
Bolt is a velox fork within ByteDance. By integrating with our production environment, Bolt has effectively addressed numerous stability challenges—including off-heap OOM and core dump issues—while also delivering significant performance optimizations, such as those achieved through LLVM-based JIT compilation.
We are now planning to open source Bolt and enable the broader community to use it through Apache Gluten.
In this thread, I’d like to start a discussion with you on how we can better leverage Bolt within the Gluten ecosystem. Any questions, and use cases are highly welcome!
More detailed technical information will be shared soon. Looking forward to your thoughts!
cc @FelixYBW @frankobe @guhaiyan0221
Beta Was this translation helpful? Give feedback.
All reactions