In global navigation satellite system denial environment,cross-view geo-localization based on image retrieval presents an exceedingly critical visual localization solution for Unmanned Aerial Vehicle(UAV)systems.The e...In global navigation satellite system denial environment,cross-view geo-localization based on image retrieval presents an exceedingly critical visual localization solution for Unmanned Aerial Vehicle(UAV)systems.The essence of cross-view geo-localization resides in matching images containing the same geographical targets from disparate platforms,such as UAV-view and satellite-view images.However,images of the same geographical targets may suffer from occlusions and geometric distortions due to variations in the capturing platform,view,and timing.The existing methods predominantly extract features by segmenting feature maps,which overlook the holistic semantic distribution and structural information of objects,resulting in loss of image information.To address these challenges,dilated neighborhood attention Transformer is employed as the feature extraction backbone,and Multi-feature representations based on Multi-scale Hierarchical Contextual Aggregation(MMHCA)is proposed.In the proposed MMHCA method,the multiscale hierarchical contextual aggregation method is utilized to extract contextual information from local to global across various granularity levels,establishing feature associations of contextual information with global and local information in the image.Subsequently,the multi-feature representations method is utilized to obtain rich discriminative feature information,bolstering the robustness of model in scenarios characterized by positional shifts,varying distances,and scale ambiguities.Comprehensive experiments conducted on the extensively utilized University-1652 and SUES-200 benchmarks indicate that the MMHCA method surpasses the existing techniques.showing outstanding results in UAV localization and navigation.展开更多
The goal of street-to-aerial cross-view image geo-localization is to determine the location of the query street-view image by retrieving the aerial-view image from the same place.The drastic viewpoint and appearance g...The goal of street-to-aerial cross-view image geo-localization is to determine the location of the query street-view image by retrieving the aerial-view image from the same place.The drastic viewpoint and appearance gap between the aerial-view and the street-view images brings a huge challenge against this task.In this paper,we propose a novel multiscale attention encoder to capture the multiscale contextual information of the aerial/street-view images.To bridge the domain gap between these two view images,we first use an inverse polar transform to make the street-view images approximately aligned with the aerial-view images.Then,the explored multiscale attention encoder is applied to convert the image into feature representation with the guidance of the learnt multiscale information.Finally,we propose a novel global mining strategy to enable the network to pay more attention to hard negative exemplars.Experiments on standard benchmark datasets show that our approach obtains 81.39%top-1 recall rate on the CVUSA dataset and 71.52%on the CVACT dataset,achieving the state-of-the-art performance and outperforming most of the existing methods significantly.展开更多
Numerous news or event pictures are taken and shared on the internet every day that have abundant information worth being mined,but only a small fraction of them are geotagged.The visual content of the news image hint...Numerous news or event pictures are taken and shared on the internet every day that have abundant information worth being mined,but only a small fraction of them are geotagged.The visual content of the news image hints at clues of the geographical location because they are usually taken at the site of the incident,which provides a prerequisite for geo-localization.This paper proposes an automated pipeline based on deep learning for the geo-localization of news pictures in a large-scale urban environment using geotagged street view images as a reference dataset.The approach obtains location information by constructing an attention-based feature extraction network.Then,the image features are aggregated,and the candidate street view image results are retrieved by the selective matching kernel function.Finally,the coordinates of the news images are estimated by the kernel density prediction method.The pipeline is tested in the news pictures in Hong Kong.In the comparison experiments,the proposed pipeline shows stable performance and generalizability in the large-scale urban environment.In addition,the performance analysis of components in the pipeline shows the ability to recognize localization features of partial areas in pictures and the effectiveness of the proposed solution in news picture geo-localization.展开更多
Consider the geo-localization task of finding the pose of a camera in a large 3 D scene from a single image.Most existing CNN-based methods use as input textured images.We aim to experimentally explore whether texture...Consider the geo-localization task of finding the pose of a camera in a large 3 D scene from a single image.Most existing CNN-based methods use as input textured images.We aim to experimentally explore whether texture and correlation between nearby images are necessary in a CNN-based solution for the geo-localization task.To do so,we consider lean images,textureless projections of a simple 3 D model of a city.They only contain information related to the geometry of the scene viewed(edges,faces,and relative depth).The main contributions of this paper are:(i)to demonstrate the ability of CNNs to recover camera pose using lean images;and(ii)to provide insight into the role of geometry in the CNN learning process.展开更多
The problem of associating the agricultural market names on web sites with their locations is essential for geographical analysis of the agricultural products. In this paper, an algorithm which employs the administrat...The problem of associating the agricultural market names on web sites with their locations is essential for geographical analysis of the agricultural products. In this paper, an algorithm which employs the administrative ontology and the statistics from the search results were proposed. The experiments with 100 market names collected from web sites were conducted. The experimental results demonstrate that the algorithm proposed obtains satisfactory performance in resolving the problem above, thus the effectiveness of the method is verified.展开更多
Support Vector Machine (SVM) is a powerful methodology for solving problems in non-linear classification, function estimation and density estimation, which has also led to many other recent developments in kernel base...Support Vector Machine (SVM) is a powerful methodology for solving problems in non-linear classification, function estimation and density estimation, which has also led to many other recent developments in kernel based methods in general. This paper presents a highaccuracy and fault-tolerant SVM for the mobile geo-location problem, which is an important component of pervasive computing. Simulation results show its basic location performance, and illustrate impacts of the number of training samples and training area on test location error.展开更多
In a Multiple-Input Multiple-Output (MIMO) Orthogonal Frequency Division Multiplexing (OFDM) based Wireless Local Area Network (WLAN) system, both Access Points (APs) and Mobile Termi-nals (MTs) are configured with mu...In a Multiple-Input Multiple-Output (MIMO) Orthogonal Frequency Division Multiplexing (OFDM) based Wireless Local Area Network (WLAN) system, both Access Points (APs) and Mobile Termi-nals (MTs) are configured with multiple antennas, to make novel indoor geo-location method possible. In this paper, we presented a novel Least Square Support Vector Machine (LS-SVM) based data fusion algorithm to fuse signal strength measurements for indoor geo-location using only a single AP with MIMO arrays. We evaluate our proposed algorithms under indoor environments by MATLAB simulations. Simulation results show that our MIMO-based algorithm is superior to conventional least square algorithm.展开更多
Emergency ambulance services in the UK are tasked with providing pre-hospital patient care and clinical services with a target response time between call connect to on-scene attendance.In 2017,NHS England introduced f...Emergency ambulance services in the UK are tasked with providing pre-hospital patient care and clinical services with a target response time between call connect to on-scene attendance.In 2017,NHS England introduced four new response time categories based on patient needs.The most challenging is to be on-scene for a life-threatening situation within seven minutes of the call being connected when such calls are random in terms of time and place throughout a large territory.Recent evidence indicates emergency ambulance services regularly fall short of achieving the target ambulance response times set by the National Health Service(NHS).To achieve these targets,they need to undertake transformational change and apply statistical,operations research and artificial intelligence techniques in the form of five separate modules covering demand forecasting,plus locate,allocate,dispatch,monitoring and re-deployment of resources.These modules should be linked in real-time employing a data warehouse to minimise computational data and generate accurate,meaningful and timely decisions ensuring patients receive an appropriate and timely response.A simulation covering a limited geographical area,time and operational data concluded that this form of integration of the five modules provides accurate and timely data upon which to make decisions that effectively improve ambulance response times.展开更多
基金supported by the National Natural Science Foundation of China(Nos.12072027,62103052,61603346 and 62103379)the Henan Key Laboratory of General Aviation Technology,China(No.ZHKF-230201)+3 种基金the Funding for the Open Research Project of the Rotor Aerodynamics Key Laboratory,China(No.RAL20200101)the Key Research and Development Program of Henan Province,China(Nos.241111222000 and 241111222900)the Key Science and Technology Program of Henan Province,China(No.232102220067)the Scholarship Funding from the China Scholarship Council(No.202206030079).
文摘In global navigation satellite system denial environment,cross-view geo-localization based on image retrieval presents an exceedingly critical visual localization solution for Unmanned Aerial Vehicle(UAV)systems.The essence of cross-view geo-localization resides in matching images containing the same geographical targets from disparate platforms,such as UAV-view and satellite-view images.However,images of the same geographical targets may suffer from occlusions and geometric distortions due to variations in the capturing platform,view,and timing.The existing methods predominantly extract features by segmenting feature maps,which overlook the holistic semantic distribution and structural information of objects,resulting in loss of image information.To address these challenges,dilated neighborhood attention Transformer is employed as the feature extraction backbone,and Multi-feature representations based on Multi-scale Hierarchical Contextual Aggregation(MMHCA)is proposed.In the proposed MMHCA method,the multiscale hierarchical contextual aggregation method is utilized to extract contextual information from local to global across various granularity levels,establishing feature associations of contextual information with global and local information in the image.Subsequently,the multi-feature representations method is utilized to obtain rich discriminative feature information,bolstering the robustness of model in scenarios characterized by positional shifts,varying distances,and scale ambiguities.Comprehensive experiments conducted on the extensively utilized University-1652 and SUES-200 benchmarks indicate that the MMHCA method surpasses the existing techniques.showing outstanding results in UAV localization and navigation.
基金National Natural Science Foundation of China,Grant/Award Number:62106177supported by the Central University Basic Research Fund of China(No.2042020KF0016)supported by the supercomputing system in the Supercomputing Center of Wuhan University.
文摘The goal of street-to-aerial cross-view image geo-localization is to determine the location of the query street-view image by retrieving the aerial-view image from the same place.The drastic viewpoint and appearance gap between the aerial-view and the street-view images brings a huge challenge against this task.In this paper,we propose a novel multiscale attention encoder to capture the multiscale contextual information of the aerial/street-view images.To bridge the domain gap between these two view images,we first use an inverse polar transform to make the street-view images approximately aligned with the aerial-view images.Then,the explored multiscale attention encoder is applied to convert the image into feature representation with the guidance of the learnt multiscale information.Finally,we propose a novel global mining strategy to enable the network to pay more attention to hard negative exemplars.Experiments on standard benchmark datasets show that our approach obtains 81.39%top-1 recall rate on the CVUSA dataset and 71.52%on the CVACT dataset,achieving the state-of-the-art performance and outperforming most of the existing methods significantly.
文摘Numerous news or event pictures are taken and shared on the internet every day that have abundant information worth being mined,but only a small fraction of them are geotagged.The visual content of the news image hints at clues of the geographical location because they are usually taken at the site of the incident,which provides a prerequisite for geo-localization.This paper proposes an automated pipeline based on deep learning for the geo-localization of news pictures in a large-scale urban environment using geotagged street view images as a reference dataset.The approach obtains location information by constructing an attention-based feature extraction network.Then,the image features are aggregated,and the candidate street view image results are retrieved by the selective matching kernel function.Finally,the coordinates of the news images are estimated by the kernel density prediction method.The pipeline is tested in the news pictures in Hong Kong.In the comparison experiments,the proposed pipeline shows stable performance and generalizability in the large-scale urban environment.In addition,the performance analysis of components in the pipeline shows the ability to recognize localization features of partial areas in pictures and the effectiveness of the proposed solution in news picture geo-localization.
文摘Consider the geo-localization task of finding the pose of a camera in a large 3 D scene from a single image.Most existing CNN-based methods use as input textured images.We aim to experimentally explore whether texture and correlation between nearby images are necessary in a CNN-based solution for the geo-localization task.To do so,we consider lean images,textureless projections of a simple 3 D model of a city.They only contain information related to the geometry of the scene viewed(edges,faces,and relative depth).The main contributions of this paper are:(i)to demonstrate the ability of CNNs to recover camera pose using lean images;and(ii)to provide insight into the role of geometry in the CNN learning process.
基金supported by the Knowledge Innovation Program of the Chinese Academy of Sciences
文摘The problem of associating the agricultural market names on web sites with their locations is essential for geographical analysis of the agricultural products. In this paper, an algorithm which employs the administrative ontology and the statistics from the search results were proposed. The experiments with 100 market names collected from web sites were conducted. The experimental results demonstrate that the algorithm proposed obtains satisfactory performance in resolving the problem above, thus the effectiveness of the method is verified.
文摘Support Vector Machine (SVM) is a powerful methodology for solving problems in non-linear classification, function estimation and density estimation, which has also led to many other recent developments in kernel based methods in general. This paper presents a highaccuracy and fault-tolerant SVM for the mobile geo-location problem, which is an important component of pervasive computing. Simulation results show its basic location performance, and illustrate impacts of the number of training samples and training area on test location error.
文摘In a Multiple-Input Multiple-Output (MIMO) Orthogonal Frequency Division Multiplexing (OFDM) based Wireless Local Area Network (WLAN) system, both Access Points (APs) and Mobile Termi-nals (MTs) are configured with multiple antennas, to make novel indoor geo-location method possible. In this paper, we presented a novel Least Square Support Vector Machine (LS-SVM) based data fusion algorithm to fuse signal strength measurements for indoor geo-location using only a single AP with MIMO arrays. We evaluate our proposed algorithms under indoor environments by MATLAB simulations. Simulation results show that our MIMO-based algorithm is superior to conventional least square algorithm.
文摘Emergency ambulance services in the UK are tasked with providing pre-hospital patient care and clinical services with a target response time between call connect to on-scene attendance.In 2017,NHS England introduced four new response time categories based on patient needs.The most challenging is to be on-scene for a life-threatening situation within seven minutes of the call being connected when such calls are random in terms of time and place throughout a large territory.Recent evidence indicates emergency ambulance services regularly fall short of achieving the target ambulance response times set by the National Health Service(NHS).To achieve these targets,they need to undertake transformational change and apply statistical,operations research and artificial intelligence techniques in the form of five separate modules covering demand forecasting,plus locate,allocate,dispatch,monitoring and re-deployment of resources.These modules should be linked in real-time employing a data warehouse to minimise computational data and generate accurate,meaningful and timely decisions ensuring patients receive an appropriate and timely response.A simulation covering a limited geographical area,time and operational data concluded that this form of integration of the five modules provides accurate and timely data upon which to make decisions that effectively improve ambulance response times.