Light detection and ranging (LiDAR) camera systems are becoming increasingly vital for autonomous driving. The monocular three-dimensional (3D) detection task is a critical and challenging aspect of this field. However, most algorithms rely solely on manually labeled images, which is a time-consuming and labor-intensive process, and the resulting detection lacks depth information. To address this problem, a semi-supervised 3D object detection model based on LiDAR camera systems (PVONet) is proposed to improve both the detection accuracy and processing time. First, an innovative data preparation block point-voxel fusion estimation is introduced; it utilizes LiDAR points to generate 3D bounding boxes for unlabeled data, thereby significantly reducing the time compared with manual labeling. Second, a new block based on fully connected neural network for box estimation (feature extraction and 3D object detection) is presented; it is utilized to conduct feature extraction, feature correlation, and 3D box estimation on monocular images. Finally, comprehensive experiments conducted on the popular KITTI 3D detection dataset demonstrate that our PVONet is faster (30 ms on KITTI benchmark) and more accurate [with increases of 4.69%/3.82% (easy), 4.45%/2.79% (moderate), and 4.07%/3.75% (hard) aggregation processes on 3D/bird’s eye view objects compared with the baseline]. This meets the requirements for high real-time performance in autonomous vehicles applications. The results demonstrate the effectiveness of our model based on LiDAR camera systems.
One of the motivations of OptoBot Lab is to train primer students into qualified engineers or researchers. The series training programs have been designed by supervisors and implemented with tutoring for students to test and practice their knowledge from textbooks. An environment perception experiment using a 32 layers laser scanner is described in this paper. The training program design and laboratory operation is introduced. The four parts of the experiments which are preparation, sensor calibration, 3D space reconstruction, and object recognition, are the participating students’ main tasks for different teams. This entire program is one of the series training programs that play significant role in establishing solid research skill foundation for opto-electronic students.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.