Research outputs
2026
Channel-adaptive generative reconstruction and fusion for multi-sensor graph features in few-shot fault diagnosis
You, P., Wang, L., Nguyen, A., Zhang, X., & Huang, B. (2026). Channel-adaptive generative reconstruction and fusion for multi-sensor graph features in few-shot fault diagnosis. INFORMATION FUSION, 127. doi:10.1016/j.inffus.2025.103742
Agentic AI for Medicine Preface
Qiu, J., & Huang, B. (2026). Agentic AI for Medicine Preface. Lecture Notes in Computer Science, 16147 LNCS, v.
Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-In Gamma Probe
Xu, S., Hu, Y., Su, J., Elson, D. S., & Huang, B. (2026). Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-In Gamma Probe. In Lecture Notes in Computer Science (pp. 116-125). Springer Nature Switzerland. doi:10.1007/978-3-032-09784-2_12
SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction
Chen, J., Zhang, X., Hogue, M. I., Vasconcelos, F., Stoyanov, D., Elson, D. S., & Huang, B. (2026). SurgicalGS: Dynamic 3D Gaussian Splatting for Accurate Robotic-Assisted Surgical Scene Reconstruction. In Unknown Book (Vol. 15970, pp. 572-582). doi:10.1007/978-3-032-05141-7_55
2025
Learning Human Motion with Temporally Conditional Mamba
Nguyen, Q., Le, T., Huang, B., Vu, M. N., Le, N., Vo, T., & Nguyen, A. (2025). Learning Human Motion with Temporally Conditional Mamba. In Proceedings of the SIGGRAPH Asia 2025 Conference Papers (pp. 1-10). ACM. doi:10.1145/3757377.3763948
Toward Clinically Interpretable Postoperative Prediction: A Diffusion-Based Framework with Unsupervised Geometric Priors
Yun, J., Ma, F., Huang, B., & Wang, C. (2025). Toward Clinically Interpretable Postoperative Prediction: A Diffusion-Based Framework with Unsupervised Geometric Priors. In 2025 10th International Conference on Communication, Image and Signal Processing (CCISP) (pp. 61-67). IEEE. doi:10.1109/ccisp67522.2025.11282335
GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System
Nguyen, Q., Le, T., Nguyen, H., Vo, T., Ta, T. D., Huang, B., . . . Nguyen, A. (2025). GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System. In 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 14939-14946). IEEE. doi:10.1109/iros60139.2025.11246582
SplineFormer: An Explainable Transformer Network for Autonomous Endovascular Navigation
Jianu, T., Doust, S., Li, M., Huang, B., Do, T., Nguyen, H., . . . Nguyen, A. (2025). SplineFormer: An Explainable Transformer Network for Autonomous Endovascular Navigation. In 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 7718-7725). IEEE. doi:10.1109/iros60139.2025.11246898
FedSSL: Federated Learning with Shape-Sensitive Loss for Catheter and Guidewire Segmentation
Kongtongvattana, C., Huang, B., Nguyen, H., Olajide, O., & Nguyen, A. (2025). FedSSL: Federated Learning with Shape-Sensitive Loss for Catheter and Guidewire Segmentation. In 2024 IEEE International Conference on Robotics and Biomimetics (ROBIO) (pp. 2137-2143). IEEE. doi:10.1109/robio64047.2024.10907455
FedEFM: Federated Endovascular Foundation Model with Unseen Data
Tuong, D., Nghia, V., Jianu, T., Huang, B., Minh, V., Su, J., . . . Anh, N. (2025). FedEFM: Federated Endovascular Foundation Model with Unseen Data. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 10072-10079). doi:10.1109/ICRA55743.2025.11127787
FedPGD: Federated Learning with Projected Gradient Descent for Catheter and Guidewire Segmentation
Kongtongvattana, C., Huang, B., Nguyen, H., Olajide, O., & Nguyen, A. (2025). FedPGD: Federated Learning with Projected Gradient Descent for Catheter and Guidewire Segmentation. In Lecture Notes in Networks and Systems (pp. 80-91). Springer Nature Switzerland. doi:10.1007/978-3-031-92011-0_7
Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction
Jianu, T., Huang, B., Nguyen, H., Bhattarai, B., Do, T., Tjiputra, E., . . . Nguyen, A. (2025). Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction. In Unknown Book (Vol. 15476, pp. 366-382). doi:10.1007/978-981-96-0917-8_21
Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance
Nguyen, T., Vu, M. N., Huang, B., Vuong, A., Vuong, Q., Le, N., . . . Nguyen, A. (2025). Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance. In Lecture Notes in Computer Science (pp. 363-381). Springer Nature Switzerland. doi:10.1007/978-3-031-72655-2_21
Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Nghia, N., Vu, M. N., Ta, T. D., Huang, B., Vo, T., Le, N., & Anh, N. (2025). Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 5930-5936). doi:10.1109/ICRA55743.2025.11127829
StereoMamba: Real-Time and Robust Intraoperative Stereo Disparity Estimation via Long-Range Spatial Dependencies
Wang, X., Xu, J., Zhang, S., Huang, B., Stoyanov, D., & Mazomenos, E. B. (2025). StereoMamba: Real-Time and Robust Intraoperative Stereo Disparity Estimation via Long-Range Spatial Dependencies. IEEE ROBOTICS AND AUTOMATION LETTERS, 10(10), 10682-10689. doi:10.1109/LRA.2025.3604749
Tracking Everything in Robotic-Assisted Surgery
Zhan, B., Zhao, W., Fang, Y., Du, B., Vasconcelos, F., Stoyanov, D., . . . Huang, B. (2025). Tracking Everything in Robotic-Assisted Surgery. In 2025 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) (pp. 6809-6815). doi:10.1109/ICRA55743.2025.11128141
Translating Simulation Images to X-Ray Images via Multi-scale Semantic Matching
Kang, J., Jianu, T., Huang, B., Bhattarai, B., Ngan, L., Coenen, F., & Anh, N. (2025). Translating Simulation Images to X-Ray Images via Multi-scale Semantic Matching. In Unknown Book (Vol. 15265, pp. 95-104). doi:10.1007/978-3-031-73748-0_10
2024
3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Image
Jianu, T., Huang, B., Berthet-Rayne, P., Fichera, S., & Nguyen, A. (2024). 3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Image. In Unknown Book (Vol. 1132, pp. 84-94). doi:10.1007/978-3-031-70684-4_7
CathSim: An Open-Source Simulator for Endovascular Intervention
Jianu, T., Huang, B., Vu, M. N., Abdelaziz, M. E. M. K., Fichera, S., Lee, C. -Y., . . . Nguyen, A. (2024). CathSim: An Open-Source Simulator for Endovascular Intervention. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 6(3), 971-979. doi:10.1109/TMRB.2024.3421256
Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Vuong, A. D., Vu, M. N., Le, H., Huang, B., Binh, H. T. T., Vo, T., . . . Nguyen, A. (2024). Grasp-Anything: Large-scale Grasp Dataset from Foundation Models. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) (pp. 14030-14037). doi:10.1109/ICRA57147.2024.10611277
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation
An, V., Toan, N., Minh, N. V., Huang, B., Binh, H. T. T., Thieu, V., & Anh, N. (2024). HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024 (pp. 5821-5827). doi:10.1109/IROS58592.2024.10801823
Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Toan, N., Minh, N. V., Huang, B., Tuan, V. V., Vy, T., Ngan, L., . . . Anh, N. (2024). Language-Conditioned Affordance-Pose Detection in 3D Point Clouds. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024 (pp. 3071-3078). doi:10.1109/ICRA57147.2024.10610008
Language-driven Grasp Detection
An, D. V., Minh, N. V., Baoru, H., Nghia, N., Hieu, L., Thieu, V., & Anh, N. (2024). Language-driven Grasp Detection. In 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 17902-17912). doi:10.1109/CVPR52733.2024.01695
Language-driven Grasp Detection with Mask-guided Attention
Nan, V. V., Minh, N. V., Huang, B., An, V., Ngan, L., Thieu, V., & Anh, N. (2024). Language-driven Grasp Detection with Mask-guided Attention. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024 (pp. 7492-7498). doi:10.1109/IROS58592.2024.10802256
Lightweight Language-driven Grasp Detection using Conditional Consistency Model
Nghia, N., Minh, N. V., Huang, B., Vuong, A., Ngan, L., Thieu, V., & Anh, N. (2024). Lightweight Language-driven Grasp Detection using Conditional Consistency Model. In 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024) (pp. 13719-13725). doi:10.1109/IROS58592.2024.10802007
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation
Tuan, V. V., Minh, N. V., Huang, B., Toan, N., Ngan, L., Thieu, V., & Anh, N. (2024). Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation. In 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) (pp. 13968-13975). doi:10.1109/ICRA57147.2024.10610247
Shape-Sensitive Loss for Catheter and Guidewire Segmentation
Kongtongvattana, C., Huang, B., Kang, J., Nguyen, H., Olufemi, O., & Nguyen, A. (2024). Shape-Sensitive Loss for Catheter and Guidewire Segmentation. In Unknown Book (Vol. 1132, pp. 95-107). doi:10.1007/978-3-031-70684-4_8
2023
Detecting the Sensing Area of a Laparoscopic Probe in Minimally Invasive Cancer Surgery
Huang, B., Hu, Y., Nguyen, A., Giannarou, S., & Elson, D. S. (2023). Detecting the Sensing Area of a Laparoscopic Probe in Minimally Invasive Cancer Surgery. In Unknown Conference (pp. 260-270). Springer Nature Switzerland. doi:10.1007/978-3-031-43996-4_25
CathSim: An Open-source Simulator for Endovascular Intervention
Language-driven Scene Synthesis using Multi-conditional Diffusion Model
An, D. V., Minh, N. V., Toan, T. N., Huang, B., Dzung, N., Thieu, V., & Anh, N. (2023). Language-driven Scene Synthesis using Multi-conditional Diffusion Model. In ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023). Retrieved from https://www.webofscience.com/
2022
Self-supervised Depth Estimation in Laparoscopic Image Using 3D Geometric Consistency
Huang, B., Zheng, J. -Q., Nguyen, A., Xu, C., Gkouzionis, I., Vyas, K., . . . Elson, D. S. (2022). Self-supervised Depth Estimation in Laparoscopic Image Using 3D Geometric Consistency. In MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII Vol. 13437 (pp. 13-22). doi:10.1007/978-3-031-16449-1_2
Simultaneous Depth Estimation and Surgical Tool Segmentation in Laparoscopic Images
Huang, B., Anh, N., Wang, S., Wang, Z., Mayer, E., Tuch, D., . . . Elson, D. S. (2022). Simultaneous Depth Estimation and Surgical Tool Segmentation in Laparoscopic Images. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 4(2), 335-338. doi:10.1109/TMRB.2022.3170215