Research outputs

Filter research outputs

What type of research output do you want to show?

All

Chapter

Conference Paper

Journal

Preprint

2026

Channel-adaptive generative reconstruction and fusion for multi-sensor graph features in few-shot fault diagnosis

You, P., Wang, L., Nguyen, A., Zhang, X., & Huang, B. (2026). Channel-adaptive generative reconstruction and fusion for multi-sensor graph features in few-shot fault diagnosis. INFORMATION FUSION, 127. doi:10.1016/j.inffus.2025.103742

DOI: 10.1016/j.inffus.2025.103742
: Journal article

Agentic AI for Medicine Preface

Qiu, J., & Huang, B. (2026). Agentic AI for Medicine Preface. Lecture Notes in Computer Science, 16147 LNCS, v.

: Journal article

Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-In Gamma Probe

Xu, S., Hu, Y., Su, J., Elson, D. S., & Huang, B. (2026). Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-In Gamma Probe. In Unknown Book (Vol. 16298, pp. 116-125). doi:10.1007/978-3-032-09784-2_12

DOI: 10.1007/978-3-032-09784-2_12
: Chapter

SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction

Chen, J., Zhang, X., Hogue, M. I., Vasconcelos, F., Stoyanov, D., Elson, D. S., & Huang, B. (2026). SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction. In Unknown Book (Vol. 15970, pp. 572-582). doi:10.1007/978-3-032-05141-7_55

DOI: 10.1007/978-3-032-05141-7_55
: Chapter

2025

Learning Human Motion with Temporally Conditional Mamba

Nguyen, Q., Le, T., Huang, B., Vu, M. N., Le, N., Vo, T., & Nguyen, A. (2025). Learning Human Motion with Temporally Conditional Mamba. In Proceedings of the SIGGRAPH Asia 2025 Conference Papers (pp. 1-10). ACM. doi:10.1145/3757377.3763948

DOI: 10.1145/3757377.3763948
: Conference Paper

SplineFormer: An Explainable Transformer Network for Autonomous Endovascular Navigation

Jianu, T., Doust, S., Li, M., Huang, B., Do, T., Nguyen, H., . . . Nguyen, A. (2025). SplineFormer: An Explainable Transformer Network for Autonomous Endovascular Navigation. In 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 7718-7725). IEEE. doi:10.1109/iros60139.2025.11246898

DOI: 10.1109/iros60139.2025.11246898
: Conference Paper

RoboDesign1M: A Large-scale Dataset for Robot Design Understanding

: Preprint

FedSSL: Federated Learning with Shape-Sensitive Loss for Catheter and Guidewire Segmentation

Kongtongvattana, C., Huang, B., Nguyen, H., Olajide, O., & Nguyen, A. (2025). FedSSL: Federated Learning with Shape-Sensitive Loss for Catheter and Guidewire Segmentation. In 2024 IEEE International Conference on Robotics and Biomimetics (ROBIO) (pp. 2137-2143). IEEE. doi:10.1109/robio64047.2024.10907455

DOI: 10.1109/robio64047.2024.10907455
: Conference Paper

FedEFM: Federated Endovascular Foundation Model with Unseen Data

: Preprint

SplineFormer: An Explainable Transformer-Based Approach for Autonomous Endovascular Navigation

: Preprint

FedEFM: Federated Endovascular Foundation Model with Unseen Data

Tuong, D., Nghia, V., Jianu, T., Huang, B., Minh, V., Su, J., . . . Anh, N. (2025). FedEFM: Federated Endovascular Foundation Model with Unseen Data. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 10072-10079). doi:10.1109/ICRA55743.2025.11127787

DOI: 10.1109/ICRA55743.2025.11127787
: Conference Paper

FedPGD: Federated Learning with Projected Gradient Descent for Catheter and Guidewire Segmentation

Kongtongvattana, C., Huang, B., Nguyen, H., Olajide, O., & Nguyen, A. (2025). FedPGD: Federated Learning with Projected Gradient Descent for Catheter and Guidewire Segmentation. In Lecture Notes in Networks and Systems (pp. 80-91). Springer Nature Switzerland. doi:10.1007/978-3-031-92011-0_7

DOI: 10.1007/978-3-031-92011-0_7
: Chapter

GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System

Nguyen, Q., Le, T., Nguyen, H., Vo, T., Ta, T. D., Huang, B., . . . Nguyen, A. (2025). GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System. In 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 14939-14946). IEEE. doi:10.1109/iros60139.2025.11246582

DOI: 10.1109/iros60139.2025.11246582
: Conference Paper

Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction

Jianu, T., Huang, B., Nguyen, H., Bhattarai, B., Do, T., Tjiputra, E., . . . Nguyen, A. (2025). Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction. In Unknown Book (Vol. 15476, pp. 366-382). doi:10.1007/978-981-96-0917-8_21

DOI: 10.1007/978-981-96-0917-8_21
: Chapter

Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

Nguyen, T., Vu, M. N., Huang, B., Vuong, A., Vuong, Q., Le, N., . . . Nguyen, A. (2025). Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance. In Lecture Notes in Computer Science (pp. 363-381). Springer Nature Switzerland. doi:10.1007/978-3-031-72655-2_21

DOI: 10.1007/978-3-031-72655-2_21
: Chapter

Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications

Nghia, N., Vu, M. N., Ta, T. D., Huang, B., Vo, T., Le, N., & Anh, N. (2025). Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 5930-5936). doi:10.1109/ICRA55743.2025.11127829

DOI: 10.1109/ICRA55743.2025.11127829
: Conference Paper

StereoMamba: Real-Time and Robust Intraoperative Stereo Disparity Estimation via Long-Range Spatial Dependencies

Wang, X., Xu, J., Zhang, S., Huang, B., Stoyanov, D., & Mazomenos, E. B. (2025). StereoMamba: Real-Time and Robust Intraoperative Stereo Disparity Estimation via Long-Range Spatial Dependencies. IEEE ROBOTICS AND AUTOMATION LETTERS, 10(10), 10682-10689. doi:10.1109/LRA.2025.3604749

DOI: 10.1109/LRA.2025.3604749
: Journal article

Toward Clinically Interpretable Postoperative Prediction: A Diffusion-Based Framework with Unsupervised Geometric Priors

Yun, J., Ma, F., Huang, B., & Wang, C. (2025). Toward Clinically Interpretable Postoperative Prediction: A Diffusion-Based Framework with Unsupervised Geometric Priors. In 2025 10th International Conference on Communication, Image and Signal Processing (CCISP) (pp. 61-67). IEEE. doi:10.1109/ccisp67522.2025.11282335

DOI: 10.1109/ccisp67522.2025.11282335
: Conference Paper

Tracking Everything in Robotic-Assisted Surgery

Zhan, B., Zhao, W., Fang, Y., Du, B., Vasconcelos, F., Stoyanov, D., . . . Huang, B. (2025). Tracking Everything in Robotic-Assisted Surgery. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 6809-6815). doi:10.1109/ICRA55743.2025.11128141

DOI: 10.1109/ICRA55743.2025.11128141
: Conference Paper

Translating Simulation Images to X-Ray Images via Multi-scale Semantic Matching

Kang, J., Jianu, T., Huang, B., Bhattarai, B., Ngan, L., Coenen, F., & Anh, N. (2025). Translating Simulation Images to X-Ray Images via Multi-scale Semantic Matching. In Unknown Book (Vol. 15265, pp. 95-104). doi:10.1007/978-3-031-73748-0_10

DOI: 10.1007/978-3-031-73748-0_10
: Chapter

2024

Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction

: Preprint

Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications

: Preprint

CathAction: A Benchmark for Endovascular Intervention Understanding

: Preprint

Language-driven Grasp Detection with Mask-guided Attention

: Preprint

Lightweight Language-driven Grasp Detection using Conditional Consistency Model

: Preprint

Language-driven Grasp Detection

: Preprint

Autonomous Catheterization with Open-source Simulator and Expert Trajectory

: Preprint

3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Image

Jianu, T., Huang, B., Berthet-Rayne, P., Fichera, S., & Nguyen, A. (2024). 3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Image. In Unknown Book (Vol. 1132, pp. 84-94). doi:10.1007/978-3-031-70684-4_7

DOI: 10.1007/978-3-031-70684-4_7
: Chapter

CathSim: An Open-Source Simulator for Endovascular Intervention

Jianu, T., Huang, B., Vu, M. N., Abdelaziz, M. E. M. K., Fichera, S., Lee, C. -Y., . . . Nguyen, A. (2024). CathSim: An Open-Source Simulator for Endovascular Intervention. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 6(3), 971-979. doi:10.1109/TMRB.2024.3421256

DOI: 10.1109/TMRB.2024.3421256
: Journal article

Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

Vuong, A. D., Vu, M. N., Le, H., Huang, B., Binh, H. T. T., Vo, T., . . . Nguyen, A. (2024). Grasp-Anything: Large-scale Grasp Dataset from Foundation Models. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) (pp. 14030-14037). doi:10.1109/ICRA57147.2024.10611277

DOI: 10.1109/ICRA57147.2024.10611277
: Conference Paper

HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation

An, V., Toan, N., Minh, N. V., Huang, B., Binh, H. T. T., Thieu, V., & Anh, N. (2024). HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024 (pp. 5821-5827). doi:10.1109/IROS58592.2024.10801823

DOI: 10.1109/IROS58592.2024.10801823
: Conference Paper

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

Toan, N., Minh, N. V., Huang, B., Tuan, V. V., Vy, T., Ngan, L., . . . Anh, N. (2024). Language-Conditioned Affordance-Pose Detection in 3D Point Clouds. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024 (pp. 3071-3078). doi:10.1109/ICRA57147.2024.10610008

DOI: 10.1109/ICRA57147.2024.10610008
: Conference Paper

Language-driven Grasp Detection

An, D. V., Minh, N. V., Baoru, H., Nghia, N., Hieu, L., Thieu, V., & Anh, N. (2024). Language-driven Grasp Detection. In 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 17902-17912). doi:10.1109/CVPR52733.2024.01695

DOI: 10.1109/CVPR52733.2024.01695
: Conference Paper

Language-driven Grasp Detection with Mask-guided Attention

Nan, V. V., Minh, N. V., Huang, B., An, V., Ngan, L., Thieu, V., & Anh, N. (2024). Language-driven Grasp Detection with Mask-guided Attention. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024 (pp. 7492-7498). doi:10.1109/IROS58592.2024.10802256

DOI: 10.1109/IROS58592.2024.10802256
: Conference Paper

Lightweight Language-driven Grasp Detection using Conditional Consistency Model

Nghia, N., Minh, N. V., Huang, B., Vuong, A., Ngan, L., Thieu, V., & Anh, N. (2024). Lightweight Language-driven Grasp Detection using Conditional Consistency Model. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024) (pp. 13719-13725). doi:10.1109/IROS58592.2024.10802007

DOI: 10.1109/IROS58592.2024.10802007
: Conference Paper

Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation

Tuan, V. V., Minh, N. V., Huang, B., Toan, N., Ngan, L., Thieu, V., & Anh, N. (2024). Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) (pp. 13968-13975). doi:10.1109/ICRA57147.2024.10610247

DOI: 10.1109/ICRA57147.2024.10610247
: Conference Paper

Shape-Sensitive Loss for Catheter and Guidewire Segmentation

Kongtongvattana, C., Huang, B., Kang, J., Nguyen, H., Olufemi, O., & Nguyen, A. (2024). Shape-Sensitive Loss for Catheter and Guidewire Segmentation. In Unknown Book (Vol. 1132, pp. 95-107). doi:10.1007/978-3-031-70684-4_8

DOI: 10.1007/978-3-031-70684-4_8
: Chapter

2023

3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images

: Preprint

Shape-Sensitive Loss for Catheter and Guidewire Segmentation

: Preprint

Detecting the Sensing Area of a Laparoscopic Probe in Minimally Invasive Cancer Surgery

Huang, B., Hu, Y., Nguyen, A., Giannarou, S., & Elson, D. S. (2023). Detecting the Sensing Area of a Laparoscopic Probe in Minimally Invasive Cancer Surgery. In Unknown Conference (pp. 260-270). Springer Nature Switzerland. doi:10.1007/978-3-031-43996-4_25

DOI: 10.1007/978-3-031-43996-4_25
: Conference Paper

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

: Preprint

Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation

: Preprint

Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

: Preprint

CathSim: An Open-source Simulator for Endovascular Intervention

DOI: 10.48550/arxiv.2208.01455
: Preprint

Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching

: Preprint

Language-driven Scene Synthesis using Multi-conditional Diffusion Model

An, D. V., Minh, N. V., Toan, T. N., Huang, B., Dzung, N., Thieu, V., & Anh, N. (2023). Language-driven Scene Synthesis using Multi-conditional Diffusion Model. In ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023). Retrieved from https://www.webofscience.com/

: Conference Paper

2022

Self-supervised Depth Estimation in Laparoscopic Image Using 3D Geometric Consistency

Huang, B., Zheng, J. -Q., Nguyen, A., Xu, C., Gkouzionis, I., Vyas, K., . . . Elson, D. S. (2022). Self-supervised Depth Estimation in Laparoscopic Image Using 3D Geometric Consistency. In MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII Vol. 13437 (pp. 13-22). doi:10.1007/978-3-031-16449-1_2

DOI: 10.1007/978-3-031-16449-1_2
: Conference Paper

Simultaneous Depth Estimation and Surgical Tool Segmentation in Laparoscopic Images

Huang, B., Anh, N., Wang, S., Wang, Z., Mayer, E., Tuch, D., . . . Elson, D. S. (2022). Simultaneous Depth Estimation and Surgical Tool Segmentation in Laparoscopic Images. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 4(2), 335-338. doi:10.1109/TMRB.2022.3170215

DOI: 10.1109/TMRB.2022.3170215
: Journal article