feat(autoware_lidar_bevfusion): implementation of bevusion using tensorrt #10024

knzo25 · 2025-01-27T05:14:08Z

Description

This PR introduces BEVFusion to autoware using TensorRT.
I would like to ask reviewers to let the "integration" into the pipeline/launchers to a posterior PR 🙏

How was this PR tested?

Notes for reviewers

The onnx files can be found here: TIER IV INTERNAL LINK. The models will be uploaded to a public link as the last part of the review (we are currently facing issues about the best way to distribute them without affecting CI/CD and image sizes...)

Since this package introduces early fusion, it can not directly be integrated into autoware (the lidar-only model can). Such integration should be relegated to the next PR to avoid increasing unnecessarily the number of stakeholders on this PR.

To test the PR, I recommend using the taxi project (will omit the launch command), and launch bevfusion separatedly.

ros2 launch autoware_lidar_bevfusion lidar_bevfusion.launch.xml

For now, the models must be placed in the config folder, and to change the modality (default is camera-lidar), the `yaml' file can be modified. This is the yaml file parameters needed for the lidar-only model.

/**:
  ros__parameters:
    # modality
    sensor_fusion: false
    # non-network params
    max_camera_lidar_delay: 0.12
    # plugins
    plugins_path: $(find-pkg-share autoware_lidar_bevfusion)/plugins/libautoware_tensorrt_plugins.so
    # network
    trt_precision: fp16
    cloud_capacity: 2000000
    onnx_path: "$(var model_path)/bevfusion_lidar_v2.onnx"
    engine_path: "$(var model_path)/bevfusion_lidar_v2.engine"
    # pre-process params
    densification_num_past_frames: 0
    densification_world_frame_id: map
    # post-process params
    circle_nms_dist_threshold: 0.5
    iou_nms_target_class_names: ["CAR"]
    iou_nms_search_distance_2d: 10.0
    iou_nms_threshold: 0.1
    yaw_norm_thresholds: [0.3, 0.3, 0.3, 0.3, 0.0] # refers to the class_names
    score_threshold: 0.1

Interface changes

None.

Effects on system behavior

None.

…ion_ros2 Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

github-actions · 2025-01-27T05:14:27Z

Thank you for contributing to the Autoware project!

🚧 If your pull request is in progress, switch it to draft mode.

Please ensure:

You've checked our contribution guidelines.
Your PR follows our pull request guidelines.
All required CI checks pass before marking the PR ready for review.

xmfcx · 2025-01-27T07:35:51Z

Is this related to feat(autoware_tensorrt_bevdet): add new 3d object detection method #7956 ?

knzo25 · 2025-01-27T13:58:30Z

@xmfcx
Although both use viewtransform, they are different papers and modalities
https://arxiv.org/pdf/2112.11790
https://arxiv.org/pdf/2205.13542

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 · 2025-02-21T05:01:40Z

As TensorRT was upgraded and spconv was added (autowarefoundation/autoware#5794), I will be opening this PR 🎉

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

…into feat/bevfusion

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

…into feat/bevfusion

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

codecov · 2025-02-26T10:08:38Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 26.26%. Comparing base (c3134c2) to head (fb1ac42).

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #10024      +/-   ##
==========================================
+ Coverage   26.24%   26.26%   +0.02%     
==========================================
  Files        1378     1378              
  Lines      107445   107468      +23     
  Branches    41428    41433       +5     
==========================================
+ Hits        28194    28222      +28     
+ Misses      76433    76425       -8     
- Partials     2818     2821       +3

Flag	Coverage Δ		*Carryforward flag
differential	`3.32% <ø> (?)`
differential-cuda	`2.23% <ø> (?)`
total	`26.26% <ø> (+0.02%)`	⬆️	Carriedforward from e00b2af

*This pull request uses carry forward flags. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

knzo25 · 2025-03-11T02:31:49Z

@amadeuszsz
As discussed internally, I pasted the links and how to execute the model. Please let me know if you need anything 🙏

freejumperd · 2025-03-14T11:42:14Z

@knzo25 thank you for this great work. If I may ask a quick question. In terms of bevfusion inference through trt, what's the main difference between the version under your development:https://github.com/knzo25/autoware.universe/tree/feat/bevfusion/perception/autoware_lidar_bevfusion
And the official NVDA AI IOT one:
https://github.com/NVIDIA-AI-IOT/Lidar_AI_Solution/tree/master/CUDA-BEVFusion/src/bevfusion
And I assume there is no model modification between autoware version and MIT original one? Please suggest. Thank you!

knzo25 · 2025-03-16T21:45:18Z

@freejumperd

The main difference with NVIDA AI IOT's implementation is that they use a closed source shared library for inference (sparse convolutions), even if it is a public binary. Another PR with that implementation may be sent later by another group of contributors as far as I understand. A big difference in terms of dvelopment and deployment of models, is that this PR handles models generated by our ml stack (https://github.com/tier4/AWML/tree/main/projects/BEVFusion) directly.

With respect to the original implementation, those questions may be better suited for our ml stack rather than the inference node here. But in a few words, the lidar model more or less remains the same, although we have evaluated bigger models for offline purposes (we have also tweaked some minor things that have increased performance). The camera-lidar model was too focused on nuscenes, so there have been a few other developments as well, but nothing so big.

freejumperd · 2025-03-16T21:57:41Z

@knzo25 thanks for active response knzo. So in short, just like the centerpoint inference node already published through autoware universe, the actualy inference cpp and cu code has been modifed and optimised by you and rest of community ? The nvda one may be suitable for exact original model trained on nuscense dataset but certainly not as efficient as for autoware retrained centerpoint or bevfusion should we say? And when would you expect this bevfusion trt inference + ros2 node to be available open source ?

knzo25 · 2025-03-16T22:03:38Z

@freejumperd
More information about the nvidia's implementation vs. this approach was presented in an a previous issue (do not have the link now though). I would separate nvidia's inference code with the original MIT implementation though. I just meant that the original config files and some of the processing in the camera-lidar pipeline do not work well with our vehicle's config.

As for when this will be available open source? In a way, it already is, since you can use this branch under the Apache license. As for when will this be merged, it is really up to the reviewers. If you want to play a part in that, we really would appreciate it!

freejumperd · 2025-03-16T22:28:36Z

@knzo25 thanks for explaining. I am certainly interested and plan to go through the recently published AWML pipeline and then running inference test to check FPS etc.
One last thing if I may ask here (more relavent on model side then trt). I was playing with original MIT model+weights but with our own dataset and modified data process pipeline, without touching the model architecture (different number for cameras, different intrinsic +extrinsic 6DOF, different model of top lidar etc) the zero shot performance was extremely bad and almost like no detection. So I wonder for the AWML version trained with T4+nuscense dataset (e.g. both with same amount of 6 cameras + 1 top lidar), if again I test with our own dataset/pipeline (e.g. 7 cameras, 2 lidars etc) and using the available pth (without touching the model), would you expect the AWML bevfusion model zoo would provide good generallizaiton ability? Or if e.g. the sensor suites are different (number of camera/lidar, 6DOF etc), retrain the model with own dataset would be a must ? If so then the only best way if I want to use the open pth, is to make your vehicle sensor suites as close as possible to nuscense / T4 taxi ?

knzo25 · 2025-03-16T22:46:15Z

@freejumperd
(Current) sensor fusion models, especially early fusion ones, are extremely dependent on the vehicle configuration. That is especially true for BEVFusion since it unprojects points from the camera to the BEV space. That being said, in my experience, for this method, the image features do not an impact so strong that there are no detections.
If I had to say, I would suspect an error on your pipeline (beware of intensity profiles and so on). Further support would require seeing the data and formal consulting hours (If our institutions have an agreement, we can continue this conversation on other channels).
Regarding whether retraining is a must, it really depends how you handle the data pipeline to reduce the mismatches (to some degree this can be done), but since we are not really training a foundational model here, I would always recommend fine tuning or retraining.

freejumperd · 2025-03-16T23:11:48Z

@knzo25 thanks again, what's been discussed are really helpful! Let me leverage with AWML pipeline first to revise our own data pipeline. Will come back to you later for any potential collaboration on the perception topics.

knzo25 · 2025-03-16T23:21:27Z

@freejumperd
Ahh, I just forgot something about the open source's repository implementation of BEVFusion. While the repository was finally open-sourced, the final pipeline for the camera-lidar model and export logic have not been updated (was developing in a branch on the private repo). I should send a PR soon-ish.

freejumperd · 2025-03-16T23:29:08Z

@knzo25 amazing! You guys really have done some great work for promoting such advanced early fusion architecture for open source community. Really looking forward to learn more out of it. And also looking for any coming good work leveraging with VLMs 😉

feat: moved from personal repository https://github.com/knzo25/bevfus…

fffb000

…ion_ros2 Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 requested a review from scepter914 January 27, 2025 05:14

knzo25 self-assigned this Jan 27, 2025

github-actions bot added component:perception Advanced sensor data processing and environment understanding. (auto-assigned) component:sensing Data acquisition from sensors, drivers, preprocessing. (auto-assigned) labels Jan 27, 2025

This was referenced Jan 27, 2025

feat(autoware.repos): added ros2_spconv autowarefoundation/autoware#5658

Closed

feat(ansible): upgrade for CUDA, TensorRT and CUDNN autowarefoundation/autoware#5608

Merged

knzo25 added 2 commits February 20, 2025 16:24

Merge branch 'main' into feat/bevfusion

fe2f90f

feat: added fp16 support. it is faster than centerpoint !

cacd43f

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 marked this pull request as ready for review February 21, 2025 05:01

knzo25 requested review from amadeuszsz and manato as code owners February 21, 2025 05:01

knzo25 added the run:build-and-test-differential Mark to enable build-and-test-differential workflow. (used-by-ci) label Feb 21, 2025

knzo25 added 6 commits February 21, 2025 15:20

Merge branch 'main' into feat/bevfusion

b133018

chore: spells and ci/cd

c3e6a6a

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

Merge branch 'feat/bevfusion' of github.com:knzo25/autoware.universe …

bd30acc

…into feat/bevfusion

chore: more ci/cd

2c42cb5

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: and yet more spells

741f5c0

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: more spells

ace6b1e

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 mentioned this pull request Feb 25, 2025

feat(words): added bevfusion and extrinsics autowarefoundation/autoware-spell-check-dict#24

Merged

chore: updated the schema

b944a8e

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 requested a review from kminoda as a code owner February 25, 2025 01:26

chore: reverted unintented change

f84903e

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 removed the request for review from kminoda February 25, 2025 01:28

chore: added documentation

0cda22a

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

github-actions bot added the type:documentation Creating or refining documentation. (auto-assigned) label Feb 25, 2025

This was referenced Feb 26, 2025

build-and-test-differential switch to codebuild test PR #10197

Closed

ci: use Code Build for build-and-test-differentianl-cuda #10198

Merged

knzo25 added 4 commits February 26, 2025 13:31

Merge branch 'main' into feat/bevfusion

13ff90d

chore: updated copyrights

f46a449

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

Merge branch 'feat/bevfusion' of github.com:knzo25/autoware.universe …

cbbb89f

…into feat/bevfusion

chore: ci/cd fixes

1fb3ad2

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 and others added 3 commits February 26, 2025 22:16

Merge branch 'main' into feat/bevfusion

1b12f00

Merge branch 'main' into feat/bevfusion

5e2ff4f

Merge branch 'main' into feat/bevfusion

fb1ac42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(autoware_lidar_bevfusion): implementation of bevusion using tensorrt #10024

feat(autoware_lidar_bevfusion): implementation of bevusion using tensorrt #10024

knzo25 commented Jan 27, 2025 •

edited

Loading

github-actions bot commented Jan 27, 2025 •

edited

Loading

xmfcx commented Jan 27, 2025

knzo25 commented Jan 27, 2025

knzo25 commented Feb 21, 2025

codecov bot commented Feb 26, 2025 •

edited

Loading

knzo25 commented Mar 11, 2025

freejumperd commented Mar 14, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

feat(autoware_lidar_bevfusion): implementation of bevusion using tensorrt #10024

Are you sure you want to change the base?

feat(autoware_lidar_bevfusion): implementation of bevusion using tensorrt #10024

Conversation

knzo25 commented Jan 27, 2025 • edited Loading

Description

Related links

How was this PR tested?

Notes for reviewers

Interface changes

Effects on system behavior

github-actions bot commented Jan 27, 2025 • edited Loading

xmfcx commented Jan 27, 2025

knzo25 commented Jan 27, 2025

knzo25 commented Feb 21, 2025

codecov bot commented Feb 26, 2025 • edited Loading

Codecov Report

knzo25 commented Mar 11, 2025

freejumperd commented Mar 14, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Mar 16, 2025

freejumperd commented Mar 16, 2025

knzo25 commented Jan 27, 2025 •

edited

Loading

github-actions bot commented Jan 27, 2025 •

edited

Loading

codecov bot commented Feb 26, 2025 •

edited

Loading