Hypersonic Glide Vehicles(HGVs)are advanced aircraft that can achieve extremely high speeds(generally over 5 Mach)and maneuverability within the Earth's atmosphere.HGV trajectory prediction is crucial for effectiv...Hypersonic Glide Vehicles(HGVs)are advanced aircraft that can achieve extremely high speeds(generally over 5 Mach)and maneuverability within the Earth's atmosphere.HGV trajectory prediction is crucial for effective defense planning and interception strategies.In recent years,HGV trajectory prediction methods based on deep learning have the great potential to significantly enhance prediction accuracy and efficiency.However,it's still challenging to strike a balance between improving prediction performance and reducing computation costs of the deep learning trajectory prediction models.To solve this problem,we propose a new deep learning framework(FECA-LSMN)for efficient HGV trajectory prediction.The model first uses a Frequency Enhanced Channel Attention(FECA)module to facilitate the fusion of different HGV trajectory features,and then subsequently employs a Light Sampling-oriented Multi-Layer Perceptron Network(LSMN)based on simple MLP-based structures to extract long/shortterm HGV trajectory features for accurate trajectory prediction.Also,we employ a new data normalization method called reversible instance normalization(RevIN)to enhance the prediction accuracy and training stability of the network.Compared to other popular trajectory prediction models based on LSTM,GRU and Transformer,our FECA-LSMN model achieves leading or comparable performance in terms of RMSE,MAE and MAPE metrics while demonstrating notably faster computation time.The ablation experiments show that the incorporation of the FECA module significantly improves the prediction performance of the network.The RevIN data normalization technique outperforms traditional min-max normalization as well.展开更多
The SDR-to-HDR translation technique can convert the abundant standard-dynamic-range (SDR) media resources to high-dynamic-range (HDR) ones, which can represent high-contrast scenes, providing more realistic visual ex...The SDR-to-HDR translation technique can convert the abundant standard-dynamic-range (SDR) media resources to high-dynamic-range (HDR) ones, which can represent high-contrast scenes, providing more realistic visual experiences. While recent vision Transformers have achieved promising performance in many low-level vision tasks, there are few works attempting to leverage Transformers for SDR-to-HDR translation. In this paper, we are among the first to investigate the performance of Transformers for SDR-to-HDR translation. We find that directly using the self-attention mechanism may involve artifacts in the results due to the inappropriate way to model long-range dependencies between the low-frequency and high-frequency components. Taking this into account, we advance the self-attention mechanism and present a dual frequency attention (DFA), which leverages the self-attention mechanism to separately encode the low-frequency structural information and high-frequency detail information. Based on the proposed DFA, we further design a multi-scale feature fusion network, named dual frequency Transformer (DFT), for efficient SDR-to-HDR translation. Extensive experiments on the HDRTV1K dataset demonstrate that our DFT can achieve better quantitative and qualitative performance than the recent state-of-the-art methods. The code of our DFT is made publicly available at https://github.com/CS-GangXu/DFT.展开更多
文摘Hypersonic Glide Vehicles(HGVs)are advanced aircraft that can achieve extremely high speeds(generally over 5 Mach)and maneuverability within the Earth's atmosphere.HGV trajectory prediction is crucial for effective defense planning and interception strategies.In recent years,HGV trajectory prediction methods based on deep learning have the great potential to significantly enhance prediction accuracy and efficiency.However,it's still challenging to strike a balance between improving prediction performance and reducing computation costs of the deep learning trajectory prediction models.To solve this problem,we propose a new deep learning framework(FECA-LSMN)for efficient HGV trajectory prediction.The model first uses a Frequency Enhanced Channel Attention(FECA)module to facilitate the fusion of different HGV trajectory features,and then subsequently employs a Light Sampling-oriented Multi-Layer Perceptron Network(LSMN)based on simple MLP-based structures to extract long/shortterm HGV trajectory features for accurate trajectory prediction.Also,we employ a new data normalization method called reversible instance normalization(RevIN)to enhance the prediction accuracy and training stability of the network.Compared to other popular trajectory prediction models based on LSTM,GRU and Transformer,our FECA-LSMN model achieves leading or comparable performance in terms of RMSE,MAE and MAPE metrics while demonstrating notably faster computation time.The ablation experiments show that the incorporation of the FECA module significantly improves the prediction performance of the network.The RevIN data normalization technique outperforms traditional min-max normalization as well.
基金supported by National Natural Science Foundation of China(Nos.61922046 and 62276145)the National Key Research and Development Program of China(No.2018AAA0100400)Fundamental Research Funds for Central Universities,China(No.63223049).
文摘The SDR-to-HDR translation technique can convert the abundant standard-dynamic-range (SDR) media resources to high-dynamic-range (HDR) ones, which can represent high-contrast scenes, providing more realistic visual experiences. While recent vision Transformers have achieved promising performance in many low-level vision tasks, there are few works attempting to leverage Transformers for SDR-to-HDR translation. In this paper, we are among the first to investigate the performance of Transformers for SDR-to-HDR translation. We find that directly using the self-attention mechanism may involve artifacts in the results due to the inappropriate way to model long-range dependencies between the low-frequency and high-frequency components. Taking this into account, we advance the self-attention mechanism and present a dual frequency attention (DFA), which leverages the self-attention mechanism to separately encode the low-frequency structural information and high-frequency detail information. Based on the proposed DFA, we further design a multi-scale feature fusion network, named dual frequency Transformer (DFT), for efficient SDR-to-HDR translation. Extensive experiments on the HDRTV1K dataset demonstrate that our DFT can achieve better quantitative and qualitative performance than the recent state-of-the-art methods. The code of our DFT is made publicly available at https://github.com/CS-GangXu/DFT.