Sign language dataset is essential in sign language recognition and translation(SLRT). Current public sign language datasets are small and lack diversity, which does not meet the practical application requirements for...Sign language dataset is essential in sign language recognition and translation(SLRT). Current public sign language datasets are small and lack diversity, which does not meet the practical application requirements for SLRT. However, making a large-scale and diverse sign language dataset is difficult as sign language data on the Internet is scarce. In making a large-scale and diverse sign language dataset, some sign language data qualities are not up to standard. This paper proposes a two information streams transformer(TIST) model to judge whether the quality of sign language data is qualified. To verify that TIST effectively improves sign language recognition(SLR), we make two datasets, the screened dataset and the unscreened dataset. In this experiment, this paper uses visual alignment constraint(VAC) as the baseline model. The experimental results show that the screened dataset can achieve better word error rate(WER) than the unscreened dataset.展开更多
Unmanned Aerial Vehicles(UAVs)coupled with deep learning such as Convolutional Neural Networks(CNNs)have been widely applied across numerous domains,including agriculture,smart city monitoring,and fire rescue operatio...Unmanned Aerial Vehicles(UAVs)coupled with deep learning such as Convolutional Neural Networks(CNNs)have been widely applied across numerous domains,including agriculture,smart city monitoring,and fire rescue operations,owing to their malleability and versatility.However,the computation-intensive and latency-sensitive natures of CNNs present a formidable obstacle to their deployment on resource-constrained UAVs.Some early studies have explored a hybrid approach that dynamically switches between lightweight and complex models to balance accuracy and latency.However,they often overlook scenarios involving multiple concurrent CNN streams,where competition for resources between streams can substantially impact latency and overall system performance.In this paper,we first investigate the deployment of both lightweight and complex models for multiple CNN streams in UAV swarm.Specifically,we formulate an optimization problem to minimize the total latency across multiple CNN streams,under the constraints on UAV memory and the accuracy requirement of each stream.To address this problem,we propose an algorithm called Adaptive Model Switching of collaborative inference for MultiCNN streams(AMSM)to identify the inference strategy with a low latency.Simulation results demonstrate that the proposed AMSM algorithm consistently achieves the lowest latency while meeting the accuracy requirements compared to benchmark algorithms.展开更多
For oviparous species with little or no parental care, oviposition site choice may be a powerful agent of natural selection, playing an important role in the evolution of species fitness and life history. Identifying ...For oviparous species with little or no parental care, oviposition site choice may be a powerful agent of natural selection, playing an important role in the evolution of species fitness and life history. Identifying and clarifying the microhabitat characteristics of oviposition sites are key to understanding life history strategies and local adaptations for amphibians with complex biphasic life cycles. However, oviposition strategies for amphibians in alpine streams remain poorly understood. Here, we focused on oviposition site selection for the Chinting alpine toad(Scutiger chintingensis) on Mount Wawu on the eastern margin of the Qinghai-Xizang Plateau. We identified the microhabitat differences between oviposition sites and non-oviposition sites for this species, predicted oviposition site selection patterns,and evaluated the relative importance of each microhabitat variable as well as the effects of the variables on the model prediction probabilities. This study revealed that the microhabitat characteristics of oviposition sites in this species included high water velocity, shallow water, and high-water surface coverage. Water velocity and rock volume were the most important variables explaining the categories of oviposition sites. Additionally, we observed nearly half the proportion of overlapping oviposition in this species, and the benefits of this behavior in terms of microhabitat characteristics need to be explored in the future. Our study revealed an oviposition site selection strategy for amphibians within alpine streams, and contributed to understanding the life history and environmental adaptation strategies of an endemic alpine stream species, and thereby providing guidance for effective conservation strategies for alpine stream amphibians under global change.展开更多
This study describes the gradient analysis of the freshwater macroinvertebrate assemblages in eight streams of Tenerife and La Gomera (Canary Islands) over a 16-year period. During this period, a total of 75 taxa belo...This study describes the gradient analysis of the freshwater macroinvertebrate assemblages in eight streams of Tenerife and La Gomera (Canary Islands) over a 16-year period. During this period, a total of 75 taxa belonging to 34 taxonomic families were found. Endemism has an important presence in the streams on both islands, especially regarding Trichoptera and Coleoptera. The overall status of freshwater macroinvertebrates is rather uncertain as recent data on these communities are scarce and focused on a limited number of sites. Overexploitation of aquifers and the diversion of natural water flows for irrigation have resulted in the drying up of numerous natural streams, inevitably endangering the fauna that inhabits them. A reduction in number and abundance of endemic and sensitive species was observed in the majority of the sampled streams resulting in a lower ecological rating. Therefore, it is proposed that the protection of streams of high conservation value is essential to conserve freshwater macroinvertebrate fauna native to the Canary Islands.展开更多
Cryogenic block streams consist of a stream of rocks superficially resembling a stream deposit but lacking a matrix, usually occurring on a valley or gully floor or on slopes that are less steep than the maximum angle...Cryogenic block streams consist of a stream of rocks superficially resembling a stream deposit but lacking a matrix, usually occurring on a valley or gully floor or on slopes that are less steep than the maximum angle of repose of coarse sediments. They are usually formed on perennially frozen ground, but can also occur as relict landforms. There are three main active kinds forming today, viz., Siberian and Tibetan dynamic rock streams and lag block streams. During their formation, the blocks in the active Siberian and Tibetan dynamic block streams move downslope at up to 1 rn/a. They are forming today on the Tibetan Plateau and in the more arid parts of south-central Siberia, although the processes involved in the movement are different. In the case of the Tibetan type, individual blocks slide downslope over the substrate in winter on an icy coating in areas of minimal winter precipitation. The Siberian type develops in areas of 15-80 cm of winter snow cover and an MAAT (mean annual air temperature) of-4 ~C to -17 ~C. The movement is due to creep of snow and ice and collapse of the blocks downslope during thawing. Lag block streams are formed by meltwater flowing over the surface of sediment consisting primarily of larger blocks with a limited amount of interstitial sediment. The erosion of the matrix is primarily in the spring in areas of higher winter precipitation on 10^-30~ slopes. The blocks remain stationary, but the interstitial sediment is washed out by strong seasonal flows of meltwater or rain to form an alluvial fan. The boulders undergo weathering and become more rounded in the process. Lag block streams can also develop without the presence of permafrost in areas with cold climates or glaciers. Block streams also occur as relict deposits in older deposits under various climatic regimes that are unsuitable for their formation today. An example of relict lag block streams with subangular to subrounded blocks occurs in gullies on the forested mountainsides at Felsen in Germany, and is the original "felsenmeer". Similar examples occur near Vitosha Mountain in Bulgaria. The "stone runs" in the Falkland Islands are examples of the more angular relict lag block streams. In both Tasmania and the Falkland Islands, they mask a more complex history, the underlying soils indicating periods of tropical and temperate soil formation resulting from weathering during and since the Tertiary Period. Block streams have also been reported from beneath cold-based glaciers in Sweden, and below till in Canada, and when ex- humed, can continue to develop.展开更多
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions i...Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd dataset serves as a catalyst for the development ofmore robust and effective fitnesstracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.展开更多
A new algorithm for clustering multiple data streams is proposed.The algorithm can effectively cluster data streams which show similar behavior with some unknown time delays.The algorithm uses the autoregressive (AR...A new algorithm for clustering multiple data streams is proposed.The algorithm can effectively cluster data streams which show similar behavior with some unknown time delays.The algorithm uses the autoregressive (AR) modeling technique to measure correlations between data streams.It exploits estimated frequencies spectra to extract the essential features of streams.Each stream is represented as the sum of spectral components and the correlation is measured component-wise.Each spectral component is described by four parameters,namely,amplitude,phase,damping rate and frequency.The ε-lag-correlation between two spectral components is calculated.The algorithm uses such information as similarity measures in clustering data streams.Based on a sliding window model,the algorithm can continuously report the most recent clustering results and adjust the number of clusters.Experiments on real and synthetic streams show that the proposed clustering method has a higher speed and clustering quality than other similar methods.展开更多
A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,wh...A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,which is implemented as an extended reservoir-sampling algorithm.A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively.The second step of this method is to partition the fluxes of data streams averagely,which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases:one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector.The experimental results on actual data prove that the method is efficient,practical and suitable for time-varying data streams processing.展开更多
Ephemeral and perennial streams of mountainous catchments in Sabaragamuwa Province of Sri Lanka and Hong Kong of China were studied for two years on vegetation dynamics.Each year,sampling was conducted during a period...Ephemeral and perennial streams of mountainous catchments in Sabaragamuwa Province of Sri Lanka and Hong Kong of China were studied for two years on vegetation dynamics.Each year,sampling was conducted during a period when ephemeral streams had low surface flows.Sampling was realized contiguously using belt transects.The standing crop biomass(hereafter biomass)of herbaceous vegetation in ephemeral channels was comparatively lower than perennials and so was the herb diversity.Herb diversity showed a peak from 1.5 to 4.5 m from the centerline/thalweg of ephemeral and perennial streams.Out of 24 herbs,only three were common for both.A peak herb biomass zone was observed in perennials in the same region where diversity peaked.In ephemerals,herb biomass increased laterally up to^1.5 m,and was constant thereafter.Seedling experiment results tallied with the field diversity observations of both stream types,and suggested that seed dispersion was the main reason for herb colonization.Furthermore,it showed sapling emergence to be significantly higher in perennials than ephemerals.Return period of annual maximum monthly rainfall was a strong indicator of age of trees in ephemeral streams,and elucidated the possibility of hindcasting past flow episodes.Electrical conductivity was significantly high in ephemeral streams among all the water quality parameters.The contents of the water nutrients were approximately the same in both stream types.While recommending further studies on eco-hydrology of ephemerals,we recognize ephemeral streams to be valuable references in climate change studies due to their responsiveness and representativeness in long term hydrological changes.展开更多
In order to avoid the redundant and inconsistent information in distributed data streams, a sampling method based on min-wise hash functions is designed and the practical semantics of the union of distributed data str...In order to avoid the redundant and inconsistent information in distributed data streams, a sampling method based on min-wise hash functions is designed and the practical semantics of the union of distributed data streams is defined. First, for each family of min-wise hash functions, the data with the minimum hash value are selected as local samples and the biased effect caused by frequent updates in a single data stream is filtered out. Secondly, for the same hash function, the sample with the minimum hash value is selected as the global sample and the local samples are combined at the center node to filter out the biased effect of duplicated updates. Finally, based on the obtained uniform samples, several aggregations on the defined semantics of the union of data streams are precisely estimated. The results of comparison tests on synthetic and real-life data streams demonstrate the effectiveness of this method.展开更多
基金supported by the National Language Commission to research on sign language data specifications for artificial intelligence applications and test standards for language service translation systems (No.ZDI145-70)。
文摘Sign language dataset is essential in sign language recognition and translation(SLRT). Current public sign language datasets are small and lack diversity, which does not meet the practical application requirements for SLRT. However, making a large-scale and diverse sign language dataset is difficult as sign language data on the Internet is scarce. In making a large-scale and diverse sign language dataset, some sign language data qualities are not up to standard. This paper proposes a two information streams transformer(TIST) model to judge whether the quality of sign language data is qualified. To verify that TIST effectively improves sign language recognition(SLR), we make two datasets, the screened dataset and the unscreened dataset. In this experiment, this paper uses visual alignment constraint(VAC) as the baseline model. The experimental results show that the screened dataset can achieve better word error rate(WER) than the unscreened dataset.
基金supported by the National Natural Science Foundation of China(No.61931011)the Jiangsu Provincial Key Research and Development Program,China(No.BE2021013-4)the Fundamental Research Project in University Characteristic Disciplines,China(No.ILF240071A24)。
文摘Unmanned Aerial Vehicles(UAVs)coupled with deep learning such as Convolutional Neural Networks(CNNs)have been widely applied across numerous domains,including agriculture,smart city monitoring,and fire rescue operations,owing to their malleability and versatility.However,the computation-intensive and latency-sensitive natures of CNNs present a formidable obstacle to their deployment on resource-constrained UAVs.Some early studies have explored a hybrid approach that dynamically switches between lightweight and complex models to balance accuracy and latency.However,they often overlook scenarios involving multiple concurrent CNN streams,where competition for resources between streams can substantially impact latency and overall system performance.In this paper,we first investigate the deployment of both lightweight and complex models for multiple CNN streams in UAV swarm.Specifically,we formulate an optimization problem to minimize the total latency across multiple CNN streams,under the constraints on UAV memory and the accuracy requirement of each stream.To address this problem,we propose an algorithm called Adaptive Model Switching of collaborative inference for MultiCNN streams(AMSM)to identify the inference strategy with a low latency.Simulation results demonstrate that the proposed AMSM algorithm consistently achieves the lowest latency while meeting the accuracy requirements compared to benchmark algorithms.
基金supported by the National Natural Science Foundation of China (32271737, 32071544)the Interdisciplinary Innovation Team of the Chinese Academy of Sciences (CAS) “Light of West China” Program (xbzg-zdsys-202207)+1 种基金the Shenzhen Zhilan Foundation (2023010291A)Nature Science Foundation of Sichuan Province (24NSFSC2934)。
文摘For oviparous species with little or no parental care, oviposition site choice may be a powerful agent of natural selection, playing an important role in the evolution of species fitness and life history. Identifying and clarifying the microhabitat characteristics of oviposition sites are key to understanding life history strategies and local adaptations for amphibians with complex biphasic life cycles. However, oviposition strategies for amphibians in alpine streams remain poorly understood. Here, we focused on oviposition site selection for the Chinting alpine toad(Scutiger chintingensis) on Mount Wawu on the eastern margin of the Qinghai-Xizang Plateau. We identified the microhabitat differences between oviposition sites and non-oviposition sites for this species, predicted oviposition site selection patterns,and evaluated the relative importance of each microhabitat variable as well as the effects of the variables on the model prediction probabilities. This study revealed that the microhabitat characteristics of oviposition sites in this species included high water velocity, shallow water, and high-water surface coverage. Water velocity and rock volume were the most important variables explaining the categories of oviposition sites. Additionally, we observed nearly half the proportion of overlapping oviposition in this species, and the benefits of this behavior in terms of microhabitat characteristics need to be explored in the future. Our study revealed an oviposition site selection strategy for amphibians within alpine streams, and contributed to understanding the life history and environmental adaptation strategies of an endemic alpine stream species, and thereby providing guidance for effective conservation strategies for alpine stream amphibians under global change.
文摘This study describes the gradient analysis of the freshwater macroinvertebrate assemblages in eight streams of Tenerife and La Gomera (Canary Islands) over a 16-year period. During this period, a total of 75 taxa belonging to 34 taxonomic families were found. Endemism has an important presence in the streams on both islands, especially regarding Trichoptera and Coleoptera. The overall status of freshwater macroinvertebrates is rather uncertain as recent data on these communities are scarce and focused on a limited number of sites. Overexploitation of aquifers and the diversion of natural water flows for irrigation have resulted in the drying up of numerous natural streams, inevitably endangering the fauna that inhabits them. A reduction in number and abundance of endemic and sensitive species was observed in the majority of the sampled streams resulting in a lower ecological rating. Therefore, it is proposed that the protection of streams of high conservation value is essential to conserve freshwater macroinvertebrate fauna native to the Canary Islands.
文摘Cryogenic block streams consist of a stream of rocks superficially resembling a stream deposit but lacking a matrix, usually occurring on a valley or gully floor or on slopes that are less steep than the maximum angle of repose of coarse sediments. They are usually formed on perennially frozen ground, but can also occur as relict landforms. There are three main active kinds forming today, viz., Siberian and Tibetan dynamic rock streams and lag block streams. During their formation, the blocks in the active Siberian and Tibetan dynamic block streams move downslope at up to 1 rn/a. They are forming today on the Tibetan Plateau and in the more arid parts of south-central Siberia, although the processes involved in the movement are different. In the case of the Tibetan type, individual blocks slide downslope over the substrate in winter on an icy coating in areas of minimal winter precipitation. The Siberian type develops in areas of 15-80 cm of winter snow cover and an MAAT (mean annual air temperature) of-4 ~C to -17 ~C. The movement is due to creep of snow and ice and collapse of the blocks downslope during thawing. Lag block streams are formed by meltwater flowing over the surface of sediment consisting primarily of larger blocks with a limited amount of interstitial sediment. The erosion of the matrix is primarily in the spring in areas of higher winter precipitation on 10^-30~ slopes. The blocks remain stationary, but the interstitial sediment is washed out by strong seasonal flows of meltwater or rain to form an alluvial fan. The boulders undergo weathering and become more rounded in the process. Lag block streams can also develop without the presence of permafrost in areas with cold climates or glaciers. Block streams also occur as relict deposits in older deposits under various climatic regimes that are unsuitable for their formation today. An example of relict lag block streams with subangular to subrounded blocks occurs in gullies on the forested mountainsides at Felsen in Germany, and is the original "felsenmeer". Similar examples occur near Vitosha Mountain in Bulgaria. The "stone runs" in the Falkland Islands are examples of the more angular relict lag block streams. In both Tasmania and the Falkland Islands, they mask a more complex history, the underlying soils indicating periods of tropical and temperate soil formation resulting from weathering during and since the Tertiary Period. Block streams have also been reported from beneath cold-based glaciers in Sweden, and below till in Canada, and when ex- humed, can continue to develop.
文摘Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd dataset serves as a catalyst for the development ofmore robust and effective fitnesstracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.
基金The National Natural Science Foundation of China(No.60673060)the Natural Science Foundation of Jiangsu Province(No.BK2005047)
文摘A new algorithm for clustering multiple data streams is proposed.The algorithm can effectively cluster data streams which show similar behavior with some unknown time delays.The algorithm uses the autoregressive (AR) modeling technique to measure correlations between data streams.It exploits estimated frequencies spectra to extract the essential features of streams.Each stream is represented as the sum of spectral components and the correlation is measured component-wise.Each spectral component is described by four parameters,namely,amplitude,phase,damping rate and frequency.The ε-lag-correlation between two spectral components is calculated.The algorithm uses such information as similarity measures in clustering data streams.Based on a sliding window model,the algorithm can continuously report the most recent clustering results and adjust the number of clusters.Experiments on real and synthetic streams show that the proposed clustering method has a higher speed and clustering quality than other similar methods.
基金The High Technology Research Plan of Jiangsu Prov-ince (No.BG2004034)the Foundation of Graduate Creative Program ofJiangsu Province (No.xm04-36).
文摘A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,which is implemented as an extended reservoir-sampling algorithm.A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively.The second step of this method is to partition the fluxes of data streams averagely,which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases:one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector.The experimental results on actual data prove that the method is efficient,practical and suitable for time-varying data streams processing.
基金funded by the Research Grants Council Fund of Hong Kong(Project number:Poly U152161/14E)Environment and Conservation Fund,Hong Kong(Project number:39/2011)。
文摘Ephemeral and perennial streams of mountainous catchments in Sabaragamuwa Province of Sri Lanka and Hong Kong of China were studied for two years on vegetation dynamics.Each year,sampling was conducted during a period when ephemeral streams had low surface flows.Sampling was realized contiguously using belt transects.The standing crop biomass(hereafter biomass)of herbaceous vegetation in ephemeral channels was comparatively lower than perennials and so was the herb diversity.Herb diversity showed a peak from 1.5 to 4.5 m from the centerline/thalweg of ephemeral and perennial streams.Out of 24 herbs,only three were common for both.A peak herb biomass zone was observed in perennials in the same region where diversity peaked.In ephemerals,herb biomass increased laterally up to^1.5 m,and was constant thereafter.Seedling experiment results tallied with the field diversity observations of both stream types,and suggested that seed dispersion was the main reason for herb colonization.Furthermore,it showed sapling emergence to be significantly higher in perennials than ephemerals.Return period of annual maximum monthly rainfall was a strong indicator of age of trees in ephemeral streams,and elucidated the possibility of hindcasting past flow episodes.Electrical conductivity was significantly high in ephemeral streams among all the water quality parameters.The contents of the water nutrients were approximately the same in both stream types.While recommending further studies on eco-hydrology of ephemerals,we recognize ephemeral streams to be valuable references in climate change studies due to their responsiveness and representativeness in long term hydrological changes.
基金The National Natural Science Foundation of China(No60973023,60603040)the Natural Science Foundation of Southeast University(NoKJ2009362)
文摘In order to avoid the redundant and inconsistent information in distributed data streams, a sampling method based on min-wise hash functions is designed and the practical semantics of the union of distributed data streams is defined. First, for each family of min-wise hash functions, the data with the minimum hash value are selected as local samples and the biased effect caused by frequent updates in a single data stream is filtered out. Secondly, for the same hash function, the sample with the minimum hash value is selected as the global sample and the local samples are combined at the center node to filter out the biased effect of duplicated updates. Finally, based on the obtained uniform samples, several aggregations on the defined semantics of the union of data streams are precisely estimated. The results of comparison tests on synthetic and real-life data streams demonstrate the effectiveness of this method.