feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454

knzo25 · 2024-11-25T08:31:06Z

Description

This PR is part of a series of PRs that aim to accelerate the Sensing/Perception pipeline through an appropriate use of CUDA.

List of PRs:

feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454 (pointcloud preprocessing)
feat(autoware_pointcloud_preprocessor): cuda-accelerated pointcloud concatenation #9455 (concatenation)
feat(autoware_lidar_centerpoint): added the cuda_blackboard to centerpoint #9453 (centerpoint)
transfusion (TODO)
feat(autoware_probabilistic_occupancy_grid_map): cuda accelerated implementation #9542 (OGM - the first implementation will be independent of the blackboard to ease the transition)
feat: acceleration and transport layer tier4/aip_launcher#348 (aip_launcher)
feat: acceleration and transport layer sample_sensor_kit_launch#111 (sample_sensor_kit_launch)

To use these branches, the following additions to the autoware.repos are necessary:

  vendor/cuda_blackboard:
    type: git
    url: git@github.com:knzo25/cuda_blackboard.git
    version: main
  vendor/negotiated:
    type: git
    url: https://github.com/osrf/negotiated.git
    version: master

Depending on your machine and how many nodes are in a container, the following branch may also be required:
https://github.com/knzo25/launch_ros/tree/fix/load_composable_node
There seems to be a but in ROS where if you send too many services at once some will be lost and ros_launch can not handle that.

How was this PR tested?

The sensing/perception pipeline was tested until centerpoint for TIER IV's taxi using the logging simulator.
The following tests were executed in a laptop equipped with a RTX 4060 (laptop) GPU and a Intel(R) Core(TM) Ultra 7 165H (22 cores)

Node / processing time [ms]	Current	PR
/sensing/lidar/top/crop_box_filter_self/debug/processing_time_ms	5.81	N/A
/sensing/lidar/top/crop_box_filter_mirror/debug/processing_time_ms	4.59	N/A
/sensing/lidar/top/distortion_corrector/debug/processing_time_ms	10.96	N/A
/sensing/lidar/top/ring_outlier_filter/debug/processing_time_ms	10.69	N/A
/sensing/lidar/top/cuda_pointcloud_preprocessor/debug/processing_time_ms	N/A	3.08 (2.03 are H->D copies)
/sensing/lidar/concatenate_data_synchronizer/debug/processing_time_ms	7.83	0.70
Total	38.8	3.78

10.26 speedup!

Notes for reviewers

The main branch that I used for development is feat/cuda_acceleration_and_transport_layer.
However, the changes were too big so I split the PRs. That being said, development, if any will still be on that branch (and then cherrypicked to the respective PRs), and the review changes will be cherrypicked into the development branch.

Interface changes

An additional topic is added to perform type negotiation:
Example: input/pointcloud -> input/pointcloud and input/pointcloud/cuda

Effects on system behavior

Enabling this preprocessing in the launchers should provide a much reduced latency and cpu usage (at the cost of a higher GPU usage)

…sonal repository Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

github-actions · 2024-11-25T08:31:26Z

Thank you for contributing to the Autoware project!

🚧 If your pull request is in progress, switch it to draft mode.

Please ensure:

You've checked our contribution guidelines.
Your PR follows our pull request guidelines.
All required CI checks pass before marking the PR ready for review.

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

…pointcloud changes after the first iteration Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

…ntcloud_preprocessing

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

mojomex

Thank you for the amazing PR, these performance improvements are desperately needed.

I haven't checked the PR for functionality yet, but I'll leave my first round of comments here.

The main points I'd like to address are

memory safety and idiomatic C++ (there is currently a lot of raw-pointer code which should be avoided whenever possible)
modulatiry: currently the pipeline is hard-coded and all in one place. This makes the module hard to adapt to different projects, and hard to maintain individual modules in the pipeline.

Thank you for your time!

sensing/autoware_cuda_pointcloud_preprocessor/README.md

sensing/autoware_cuda_pointcloud_preprocessor/config/cuda_pointcloud_preprocessor.param.yaml

sensing/autoware_cuda_pointcloud_preprocessor/README.md

sensing/autoware_cuda_pointcloud_preprocessor/docs/cuda-pointcloud-preprocessor.md

...reprocessor/src/cuda_organized_pointcloud_adapter/cuda_organized_pointcloud_adapter_node.cpp

...uda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu

manato

@knzo25
Thank you very much for proposing a fantastic PR, and I'm sorry for taking a long time for the review. From a viewpoint of CUDA usage, I left some comments. I'd appreciate it if you could consider them.

manato · 2024-12-26T13:19:45Z

...uda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu

+}
+
+__global__ void transformPointsKernel(
+  const InputPointType * input_points, InputPointType * output_points, int num_points,


Suggested change

const InputPointType * input_points, InputPointType * output_points, int num_points,

const InputPointType * __restrict__ input_points, InputPointType * output_points, int num_points,

To enable "read-only data cache", I would suggest using __restrict__ for read-only input array. This suggestion can be applied to the other kernel arguments.

@knzo25 could you please double check all input arrays across kernels? I think for some of them __restrict__ keyword might be also applicable. I don't know if you missed it or skipped on purpose.
I unresolved this conversation, please resolve it again after reading this comment 🙏🏻

@amadeuszsz
Ahh now I can reply here. Don't know why but before I could not

Answer: the extract kernel could indeed use restrict. The kernel alone, strictly speaking can not, but due to how the indexes are computed there is no problem

Addressed in 68d1e42

...uda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu

…loud-preprocessor.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com>

…oud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com>

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 · 2025-03-06T08:25:42Z

@amadeuszsz

@knzo25 could you please double check all input arrays across kernels? I think for some of them restrict keyword might be also applicable. I don't know if you missed it or skipped on purpose.
I unresolved this conversation, please resolve it again after reading this comment 🙏🏻

Could not answer this one, but the extract kernel could indeed use restrict. The kernel alone, strictly speaking can not, but due to how the indexes are computed there is no problem

knzo25 · 2025-03-06T08:26:51Z

@amadeuszsz
Regarding CI/CD, I have not executed in ob purpose since the blackboard is not yet added to autoware.
I will investigate the loop error tomorrow

knzo25 · 2025-03-10T07:32:52Z

terminate called after throwing an instance of 'std::system_error'
what(): Invalid argument
Aborted (core dumped)
It happens when timer is restarted (--loop) and only if output pointcloud from autoware_cuda_pointcloud_preprocessor is visualized. Could you please check if it happens to you too? If so, I would appreciate if you can investigate this issue 🙏🏻

Sorry, I only experienced it once in several experiments, and the pointcloud itself at the time was valid (checked with ros2 topic echo)

sensing/autoware_cuda_pointcloud_preprocessor/config/cuda_pointcloud_preprocessor.param.yaml

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

amadeuszsz

LGTM!

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

codecov · 2025-03-18T09:48:38Z

Codecov Report

Attention: Patch coverage is 0% with 469 lines in your changes missing coverage. Please review.

Project coverage is 26.23%. Comparing base (7686e5a) to head (2a1ff8c).
Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
...cloud_preprocessor/cuda_pointcloud_preprocessor.cu	0.00%	165 Missing ⚠️
...preprocessor/cuda_pointcloud_preprocessor_node.cpp	0.00%	124 Missing ⚠️
.../cuda_pointcloud_preprocessor/undistort_kernels.cu	0.00%	89 Missing ⚠️
...e/autoware/cuda_pointcloud_preprocessor/memory.hpp	0.00%	43 Missing ⚠️
...src/cuda_pointcloud_preprocessor/common_kernels.cu	0.00%	24 Missing ⚠️
...c/cuda_pointcloud_preprocessor/organize_kernels.cu	0.00%	17 Missing ⚠️
...rc/cuda_pointcloud_preprocessor/outlier_kernels.cu	0.00%	6 Missing ⚠️
...loud_preprocessor/cuda_pointcloud_preprocessor.hpp	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #9454      +/-   ##
==========================================
+ Coverage   26.05%   26.23%   +0.18%     
==========================================
  Files        1374     1387      +13     
  Lines      106351   107189     +838     
  Branches    40877    41227     +350     
==========================================
+ Hits        27709    28124     +415     
- Misses      75940    75996      +56     
- Partials     2702     3069     +367

Flag	Coverage Δ		*Carryforward flag
differential-cuda	`0.00% <0.00%> (?)`
total	`26.38% <ø> (+0.33%)`	⬆️	Carriedforward from 368009e

*This pull request uses carry forward flags. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

Changes had already been addressed

…cloud preprocessor (autowarefoundation#9454) * feat: moved the cuda pointcloud preprocessor and organized from a personal repository Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: fixed incorrect links Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: fixed dead links pt2 Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: fixed spelling errors Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: json schema fixes Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed comments and filled the fields Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * fix: fixed the adapter for the case when the number of points in the pointcloud changes after the first iteration Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * feat: used the cuda host allocators for aster host to device copies Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * Update sensing/autoware_cuda_pointcloud_preprocessor/docs/cuda-pointcloud-preprocessor.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> * Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com> * Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com> * style(pre-commit): autofix * Update sensing/autoware_cuda_pointcloud_preprocessor/docs/cuda-pointcloud-preprocessor.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> * Update sensing/autoware_cuda_pointcloud_preprocessor/README.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> * Update sensing/autoware_cuda_pointcloud_preprocessor/README.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> * Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> * style(pre-commit): autofix * Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com> * style(pre-commit): autofix * Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcloud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com> * style(pre-commit): autofix * chore: fixed code compilation to reflect Hirabayashi-san's memory pool proposal Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * feat: generalized the number of crop boxes. For two at least, the new approach is actually faster Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: updated config, schema, and handled the null case in a specialized way Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * feat: moving the pointcloud organization into gpu Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * feat: reimplemented the organized pointcloud adapter in cuda. the only bottleneck is the H->D copy Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed redundant ternay operator Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: added a temporary memory check. the check will be unified in a later PR Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: refactored the structure to avoid large files Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: updated the copyright year Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * fix: fixed a bug in the undistortion kernel setup. validated it comparing it with the baseline Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed unused packages Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed mentions of the removed adapter Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: fixed missing autoware prefix Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * fix: missing assignment in else branch Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: added cuda/nvcc debug flags on debug builds Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: refactored parameters for the undistortion settings Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed unused headers Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: changed default crop box to no filtering at all Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * feat: added missing restrict keyword Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: spells Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: removed default destructor Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: ocd activated (spelling) Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: fixed the schema Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: improved readibility Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: added dummy crop box Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: added new repositories to ansible Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: CI/CD Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: more CI/CD Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: mode CI/CD. some linters are conflicting Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * style(pre-commit): autofix * chore: ignoring uncrustify Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: ignoring more uncrustify Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: missed one more uncrustify exception Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> * chore: added meta dep Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> --------- Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp> Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com> Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Amadeusz Szymko <amadeusz.szymko.2@tier4.jp>

feat: moved the cuda pointcloud preprocessor and organized from a per…

af2e884

…sonal repository Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

github-actions bot added type:documentation Creating or refining documentation. (auto-assigned) component:sensing Data acquisition from sensors, drivers, preprocessing. (auto-assigned) tag:require-cuda-build-and-test labels Nov 25, 2024

knzo25 self-assigned this Nov 25, 2024

knzo25 added 4 commits November 25, 2024 18:06

chore: fixed incorrect links

774e099

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: fixed dead links pt2

be04f76

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: fixed spelling errors

db02ec7

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: json schema fixes

5218a4a

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

This was referenced Nov 25, 2024

feat(autoware_lidar_centerpoint): added the cuda_blackboard to centerpoint #9453

Open

feat(autoware_pointcloud_preprocessor): cuda-accelerated pointcloud concatenation #9455

Closed

feat: acceleration and transport layer tier4/aip_launcher#348

Open

knzo25 marked this pull request as ready for review November 25, 2024 09:47

yukkysaito requested review from YoshiRi and drwnz November 26, 2024 01:22

knzo25 added 2 commits November 26, 2024 13:40

chore: removed comments and filled the fields

4a9daaf

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

fix: fixed the adapter for the case when the number of points in the …

84a4b9f

…pointcloud changes after the first iteration Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 mentioned this pull request Nov 26, 2024

feat: acceleration and transport layer autowarefoundation/sample_sensor_kit_launch#111

Draft

knzo25 requested review from manato, mojomex and amadeuszsz November 26, 2024 05:38

knzo25 added 2 commits December 23, 2024 14:12

Merge remote-tracking branch 'awf/main' into feat/cuda_blackboard_poi…

1e27534

…ntcloud_preprocessing

feat: used the cuda host allocators for aster host to device copies

b3c1d72

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 mentioned this pull request Dec 23, 2024

Leverage cuda acceleration in the sensing perception pipeline #9722

Open

3 tasks

technolojin self-requested a review December 24, 2024 01:22

mojomex requested changes Dec 24, 2024

View reviewed changes

manato reviewed Jan 7, 2025

View reviewed changes

knzo25 and others added 2 commits January 10, 2025 18:29

Update sensing/autoware_cuda_pointcloud_preprocessor/docs/cuda-pointc…

887f162

…loud-preprocessor.md Co-authored-by: Max Schmeller <6088931+mojomex@users.noreply.github.com>

Update sensing/autoware_cuda_pointcloud_preprocessor/src/cuda_pointcl…

0b46fb6

…oud_preprocessor/cuda_pointcloud_preprocessor.cu Co-authored-by: Manato Hirabayashi <3022416+manato@users.noreply.github.com>

knzo25 added 5 commits March 6, 2025 16:03

chore: spells

5dba7b3

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: removed default destructor

92b04a3

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: ocd activated (spelling)

17af8d8

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: fixed the schema

652d168

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: improved readibility

2449e67

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 requested review from mojomex and amadeuszsz March 10, 2025 07:33

amadeuszsz requested changes Mar 12, 2025

View reviewed changes

sensing/autoware_cuda_pointcloud_preprocessor/config/cuda_pointcloud_preprocessor.param.yaml Outdated Show resolved Hide resolved

chore: added dummy crop box

9ededc5

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

amadeuszsz approved these changes Mar 17, 2025

View reviewed changes

knzo25 mentioned this pull request Mar 17, 2025

feat(autoware.repos): added the cuda_blackboard and negotiated to the repos file autowarefoundation/autoware#5710

Merged

chore: added new repositories to ansible

5b55567

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 added the run:build-and-test-differential Mark to enable build-and-test-differential workflow. (used-by-ci) label Mar 18, 2025

knzo25 and others added 7 commits March 18, 2025 16:22

chore: CI/CD

405e5f8

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: more CI/CD

df442d5

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: mode CI/CD. some linters are conflicting

f4201b6

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

style(pre-commit): autofix

3fcbc42

chore: ignoring uncrustify

47e33bf

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: ignoring more uncrustify

8b4700b

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

chore: missed one more uncrustify exception

368009e

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

knzo25 added 2 commits March 18, 2025 19:05

chore: added meta dep

a2713a0

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

Merge branch 'main' into feat/cuda_blackboard_pointcloud_preprocessing

2a1ff8c

knzo25 merged commit 660ae1a into autowarefoundation:main Mar 18, 2025
35 of 37 checks passed

knzo25 mentioned this pull request Mar 19, 2025

feat(autoware_cuda_pointcloud_preprocessor): pointcloud concatenation #10300

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454

feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454

knzo25 commented Nov 25, 2024 •

edited

Loading

github-actions bot commented Nov 25, 2024 •

edited

Loading

mojomex left a comment

manato left a comment

manato Dec 26, 2024

amadeuszsz Mar 3, 2025

knzo25 Mar 17, 2025

knzo25 Mar 17, 2025

knzo25 commented Mar 6, 2025

knzo25 commented Mar 6, 2025

knzo25 commented Mar 10, 2025

amadeuszsz left a comment

codecov bot commented Mar 18, 2025 •

edited

Loading

	const InputPointType * input_points, InputPointType * output_points, int num_points,
	const InputPointType * __restrict__ input_points, InputPointType * output_points, int num_points,

feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454

feat(autoware_cuda_pointcloud_preprocessor): a cuda-accelerated pointcloud preprocessor #9454

Conversation

knzo25 commented Nov 25, 2024 • edited Loading

Description

Related links

How was this PR tested?

Notes for reviewers

Interface changes

Effects on system behavior

github-actions bot commented Nov 25, 2024 • edited Loading

mojomex left a comment

Choose a reason for hiding this comment

manato left a comment

Choose a reason for hiding this comment

manato Dec 26, 2024

Choose a reason for hiding this comment

amadeuszsz Mar 3, 2025

Choose a reason for hiding this comment

knzo25 Mar 17, 2025

Choose a reason for hiding this comment

knzo25 Mar 17, 2025

Choose a reason for hiding this comment

knzo25 commented Mar 6, 2025

knzo25 commented Mar 6, 2025

knzo25 commented Mar 10, 2025

amadeuszsz left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 18, 2025 • edited Loading

Codecov Report

knzo25 commented Nov 25, 2024 •

edited

Loading

github-actions bot commented Nov 25, 2024 •

edited

Loading

codecov bot commented Mar 18, 2025 •

edited

Loading