Image and Video Technology
11th Pacific-Rim Symposium, PSIVT 2023, Auckland, New Zealand, November 22–24, 2023, Proceedings
Article
In this paper, we present a novel deep learning method for detecting and tracking vehicles within the context of autonomous driving, particularly focusing on scenarios related to vehicle failures. Ensuring the...
Article
Deep learning-based visual object detection is a fundamental aspect of computer vision. These models not only locate and classify multiple objects within an image, but they also identify bounding boxes. The fo...
Article
Traditional models for pose estimation in video surveillance are based on graph structures, in this paper, we propose a method that breaks the limitation of template matching within a range of pose changes to ...
Article
To effectively extract and classify the information from reports or documents and protect the privacy of the extracted results, we propose a privacy classification named Word Embedding Combination Privacy-pres...
Article
Semi-supervised learning offers a solution to the high cost and limited availability of manually labeled samples in supervised learning. In semi-supervised visual object detection, the use of unlabeled data ca...
Article
Waste categorization and recycling are critical approaches for converting waste into valuable and functional materials, thereby significantly aiding in land preservation, reducing pollution, and optimizing res...
Chapter and Conference Paper
In this paper, we propose an improved YOLOv8-based Kiwifruit detection method using Swin Transformer, aiming to address challenges posed by significant scale variation and inaccuracies in multiscale object det...
Chapter and Conference Paper
In this paper, utilizing a multiscale training dataset, YOLOv8 demonstrates rapid inference capabilities and exceptional accuracy in detecting visual objects, particularly smaller ones. This outperforms transf...
Chapter and Conference Paper
Human face mask detection leverages computer vision technology to discern whether individuals in images or videos are wearing masks. Ensuring proper mask usage is crucial in settings such as hospital operating...
Chapter and Conference Paper
Given the prevalence of worldwide pandemics, the need of adhering to appropriate mask use becomes more paramount. Therefore, the importance of develo** a human face mask detection model that is both efficien...
Book and Conference Proceedings
11th Pacific-Rim Symposium, PSIVT 2023, Auckland, New Zealand, November 22–24, 2023, Proceedings
Article
We describe a non-destructive test of apple ripeness using digital images of multiple types of apples. In this paper, fruit images are treated as data samples, artificial intelligence models are employed to im...
Article
Pattern classification has always been essential in computer vision. Transformer paradigm having attention mechanism with global receptive field in computer vision improves the efficiency and effectiveness of ...
Article
Sign language recognition is one of the fundamental ways to assist deaf people to communicate with others. An accurate vision-based sign language recognition system using deep learning is a fundamental goal fo...
Article
Colorectal cancer (CRC) is caused by malignant polyps which must be resected and examined for accurate classification. Biopsy, the manual workflow of polyp classification is time-intensive task and requires an...
Article
Gait-based pedestrian identification has important applications in intelligent surveillance. From anatomical viewpoint, the physical uniqueness of human gait is physiological discriminative of individuals. The...
Chapter and Conference Paper
In New Zealand (NZ), agriculture is an essential industry, Kiwifruits contribute significantly to the country’s overall exports. Traditionally Kiwifruits require manually picking up and heavily relies on human...
Chapter and Conference Paper
Traffic signs are essentially needed to obey the traffic rules. Once a driver ignores the signs, especially those critical signs, due to the complexity of actual traffic scenes or the influence of inclement we...
Chapter and Conference Paper
Autonomous cars can accurately perceive the deployment of traffic scenes and the distance between visual objects in the scenarios through understanding the depth. Therefore, the depth estimation of scenes is a...
Chapter and Conference Paper
In this paper, ConvNeXt is selected as a model for waste classification from digital images. ConvNeXt is a CNN-based backbone network that has been proposed to further improve the performance of models for vis...