-
Article
PCGC: a performance compact graph compiler based on multilevel fusion-splitting rules
The existing deep learning compilers are unable to perform efficient hardware performance-related graph fusion when both time and power consumption are considered. In addition, the compilers optimize the compu...
-
Article
LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs
Channel pruning has recently become a widely used model compression method. However, most existing channel pruning methods only prune to decrease the model size, such as the number of parameters or FLOPs, and ...
-
Article
High anti-interference and FPGA-oriented method for real-time ship detection based on structured LBP features
With the continuous enhancement of remote sensing technology, using satellite to detect and identify targets has important research significance in military and civil fields. Due to the influence of many inter...
-
Article
High-efficient MPSoC-based CNNs accelerator with optimized storage and dataflow
The convolutional neural networks (CNNs) are widely used in modern AI systems for their superior accuracy but at the cost of high computational complexity, which involve enormous communication bandwidth and st...
-
Article
Physical-barrier detection based collective motion analysis
Collective motion is one of the most fascinating phenomena and mainly caused by the interactions between individuals. Physical-barriers, as the particular facilities which divide the crowd into different lanes...