Experimental and theoretical studies of the mechanisms of vibration stimulation of oil recovery in watered fields lead to the conclusion that resonance oscillations develop in fractured-block formations. These oscilla...Experimental and theoretical studies of the mechanisms of vibration stimulation of oil recovery in watered fields lead to the conclusion that resonance oscillations develop in fractured-block formations. These oscillations, caused by weak but long-lasting and frequency-stable influences, create the conditions for ultrasonic wave’s generation in the layers, which are capable of destroying thickened oil membranes in reservoir cracks. For fractured-porous reservoirs in the process of exploitation by the method of water high-pressure oil displacement, the possibility of intensifying ultrasonic vibrations can have an important technological significance. Even a very weak ultrasound can destroy, over a long period of time, the viscous oil membranes formed in the cracks between the blocks, which can be the reason for lowering the permeability of the layers and increasing the oil recovery. To describe these effects, it is necessary to consider the wave process in a hierarchically blocky environment and theoretically simulate the mechanism of the appearance of self-oscillations under the action of relaxation shear stresses. For the analysis of seism acoustic response in time on fixed intervals along the borehole an algorithm of phase diagrams of the state of many-phase medium is suggested.展开更多
Clock difference between the ensemble pulsar timescale(PT)and the International Atomic Time(TAI)PT-TAI derived from the International Pulsar Timing Array(IPTA)data set indicates a very similar variation trend with the...Clock difference between the ensemble pulsar timescale(PT)and the International Atomic Time(TAI)PT-TAI derived from the International Pulsar Timing Array(IPTA)data set indicates a very similar variation trend with the Terrestrial Time TT(BIPMXXXX)-TAI but PT has larger measurement error.In this paper,we discuss the smoothing method of PT using a combined smoothing filter and compare the results with that from other filters.The clock difference sequence between PT-TAI and the first time derivative series of the TT(BIPMXXXX)-TAI can be combined by a combined smoothing filter to yield two smooth curves tied by the constraints assuring that the latter is the derivative of the former.The ensemble pulsar time IPTA2016 with respect to TAI published by G.Hobbs et al.and first time derivative series of the TT(BIPM2017)-TAI with quadratic polynomial terms removed are processed by combined smoothing filter in order to demonstrate the properties of the smoothed results.How to correctly estimate two smoothing coefficients is described and the output results of the combined smoothing filter are analyzed.The results show that the combined smoothing method efficiently removes high frequency noises of two input data series and the smoothed data of the PT-TAI combine long term fractional frequency stability of the pulsar time and frequency accuracy of the terrestrial time.Fractional frequency stability analysis indicates that both short and medium time interval stability of the smoothed PT-TAI is improved while keeping its original long term frequency stability level.The combined smoothing filter is more suitable for smoothing observational pulsar timescale data than any filter that only performs smoothing of a single pulsar time series.The smoothed pulsar time by combined smoothing filter is a pulsar atomic time combined timescale.This kind of combined timescale can also be used as terrestrial time.展开更多
A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of inter...A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of interest are first identified in geomagnetic time series by inspection. Time series of these "training events" are represented in matrix form and transpose-multiplied to generate time- domain covariance matrices. The ranked eigenvectors of this matrix are stored as a feature of the pulsation. In the second stage of the algorithm, a sliding window (approxi- mately the width of the training event) is moved across the vector-valued time-series comprising the channels on which the training event was observed. At each window position, the data covariance matrix and associated eigen- vectors are calculated. We compare the orientation of the dominant eigenvectors of the training data to those from the windowed data and flag windows where the dominant eigenvectors directions are similar. This was successful in automatically identifying pulses which share polarization and appear to be from the same source process. We apply the method to a case study of continuously sampled (50 Hz) data from six observatories, each equipped with three- component induction coil magnetometers. We examine a 90-day interval of data associated with a cluster of four observatories located within 50 km of Napa, California, together with two remote reference stations-one 100 km to the north of the cluster and the other 350 km south. When the training data contains signals present in the remote reference observatories, we are reliably able to identify and extract global geomagnetic signals such as solar-generated noise. When training data contains pulsations only observed in the cluster of local observatories, we identify several types of non-plane wave signals having similar polarization.展开更多
As an essential component of bus dwelling time, passenger boarding time has a significant impact on bus running reliability and service quality. In order to understand the passengers’ boarding process and mitigate pa...As an essential component of bus dwelling time, passenger boarding time has a significant impact on bus running reliability and service quality. In order to understand the passengers’ boarding process and mitigate passenger boarding time, a regression analysis framework is proposed to capture the difference and influential factors of boarding time for adult and elderly passengers based on smart card data from Changzhou. Boarding gap, the time difference between two consecutive smart card tapping records, is calculated to approximate passenger boarding time. Analysis of variance is applied to identify whether the difference in boarding time between adults and seniors is statistically significant. The multivariate regression modeling approach is implemented to analyze the influences of passenger types, marginal effects of each additional boarding passenger and bus floor types on the total boarding time at each stop. Results show that a constant difference exists in boarding time between adults and seniors even without considering the specific bus characteristics. The average passenger boarding time decreases when the number of passenger increases. The existence of two entrance steps delays the boarding process, especially for elderly passengers.展开更多
This study aims at assessing the relative impacts of four major components of the tropical Pacific Ocean observing system on assimilation of temperature and salinity fields. Observations were collected over a period b...This study aims at assessing the relative impacts of four major components of the tropical Pacific Ocean observing system on assimilation of temperature and salinity fields. Observations were collected over a period between January 2001 through June 2003 including temperature data from the expendable bathythermographs (XBT), thermistor data from the Tropical Ocean Global Atmosphere Tropical Atmosphere-Ocean (TOGA-TAO) mooring array, sea level anomalies from the Topex/Poseidon and Jason-1 altimetry (T/P-J), and temperature and salinity profiles from the Array for Real-time Geostrophic Oceanography (ARGO) floats. An efficient three-dimensional variational analysis-based method was introduced to assimilate the above data into the tropical-Pacific circulation model. To evaluate the impact of the individual component of the observing system, four observation system experiments were carried out. The experiment that assimilated all four components of the observing system was taken as the reference. The other three experiments were implemented by withholding one of the four components. Results show that the spatial distribution of the data influences its relative contribution. XBT observations produce the most distinguished effects on temperature analyses in the off-equatorial region due to the large amount of measurements and high quality. Similarly, the impact of TAO is dominant in the equatorial region due to the focus of the spatial distribution. The Topex/Poseidon-Jason-1 can be highly complementary where the XBT and TAO observations are sparse. The contribution of XBT or TAO on the assimilated salinity is made by the model dynamics because no salinity observations from them are assimilated. Therefore, T/P-J, as a main source for providing salinity data, has been shown to have greater impacts than either XBT or TAO on the salinity analysis. Although ARGO includes the subsurface observations, the relatively smaller number of observation makes it have the smallest contribution to the assimilation system.展开更多
In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze ...In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze the collected data and the second to conduct exploratory data analysis of the collected vehicle data.In particular,we analyzed the dangerous driving patterns of motorists based on the safety standards of the Korea Transportation Safety Authority.We also analyzed seasonal fuel efciency(four seasons)and mileage of vehicles,and identied rapid acceleration,rapid deceleration,sudden stopping(harsh braking),quick starting,sudden left turn,sudden right turn and sudden U-turn driving patterns of vehicles.We implemented the density-based spatial clustering of applications with a noise algorithm for trajectory analysis based on GPS(Global Positioning System)data and designed a long shortterm memory algorithm and an auto-regressive integrated moving average model for time-series data analysis.In this paper,we mainly describe the development environment of the analysis software,the structure and data ow of the overall analysis platform,the conguration of the collected vehicle data,and the various algorithms used in the analysis.Finally,we present illustrative results of our analysis,such as dangerous driving patterns that were detected.展开更多
The aim of this study is to establish the prevailing conditions of changing climatic trends and change point dates in four selected meteorological stations of Uyo, Benin, Port Harcourt, and Warri in the Niger Delta re...The aim of this study is to establish the prevailing conditions of changing climatic trends and change point dates in four selected meteorological stations of Uyo, Benin, Port Harcourt, and Warri in the Niger Delta region of Nigeria. Using daily or 24-hourly annual maximum series (AMS) data with the Indian Meteorological Department (IMD) and the modified Chowdury Indian Meteorological Department (MCIMD) models were adopted to downscale the time series data. Mann-Kendall (MK) trend and Sen’s Slope Estimator (SSE) test showed a statistically significant trend for Uyo and Benin, while Port Harcourt and Warri showed mild trends. The Sen’s Slope magnitude and variation rate were 21.6, 10.8, 6.00 and 4.4 mm/decade, respectively. The trend change-point analysis showed the initial rainfall change-point dates as 2002, 2005, 1988, and 2000 for Uyo, Benin, Port Harcourt, and Warri, respectively. These prove positive changing climatic conditions for rainfall in the study area. Erosion and flood control facilities analysis and design in the Niger Delta will require the application of Non-stationary IDF modelling.展开更多
Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algor...Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.展开更多
Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with o...Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with opportunities to discover valuable intelligence from the massive user generated text streams. However, the traditional content analysis frameworks are inefficient to handle the unprecedentedly big volume of unstructured text streams and the complexity of text analysis tasks for the real time opinion analysis on the big data streams. In this paper, we propose a parallel real time sentiment analysis system: Social Media Data Stream Sentiment Analysis Service (SMDSSAS) that performs multiple phases of sentiment analysis of social media text streams effectively in real time with two fully analytic opinion mining models to combat the scale of text data streams and the complexity of sentiment analysis processing on unstructured text streams. We propose two aspect based opinion mining models: Deterministic and Probabilistic sentiment models for a real time sentiment analysis on the user given topic related data streams. Experiments on the social media Twitter stream traffic captured during the pre-election weeks of the 2016 Presidential election for real-time analysis of public opinions toward two presidential candidates showed that the proposed system was able to predict correctly Donald Trump as the winner of the 2016 Presidential election. The cross validation results showed that the proposed sentiment models with the real-time streaming components in our proposed framework delivered effectively the analysis of the opinions on two presidential candidates with average 81% accuracy for the Deterministic model and 80% for the Probabilistic model, which are 1% - 22% improvements from the results of the existing literature.展开更多
Weighted priority queueing is a modification of priority queueing that eliminates the possibility of blocking lower priority traffic. The weights assigned to priority classes determine the fractions of the bandwith th...Weighted priority queueing is a modification of priority queueing that eliminates the possibility of blocking lower priority traffic. The weights assigned to priority classes determine the fractions of the bandwith that are guaranteed for individual traffic classes, similarly as in weighted fair queueing. The paper describes a timed Petri net model of weighted priority queueing and uses discrete-event simulation of this model to obtain performance characteristics of simple queueing systems. The model is also used to analyze the effects of finite queue capacity on the performance of queueing systems.展开更多
In this in-depth exploration, I delve into the complex implications and costs of cybersecurity breaches. Venturing beyond just the immediate repercussions, the research unearths both the overt and concealed long-term ...In this in-depth exploration, I delve into the complex implications and costs of cybersecurity breaches. Venturing beyond just the immediate repercussions, the research unearths both the overt and concealed long-term consequences that businesses encounter. This study integrates findings from various research, including quantitative reports, drawing upon real-world incidents faced by both small and large enterprises. This investigation emphasizes the profound intangible costs, such as trade name devaluation and potential damage to brand reputation, which can persist long after the breach. By collating insights from industry experts and a myriad of research, the study provides a comprehensive perspective on the profound, multi-dimensional impacts of cybersecurity incidents. The overarching aim is to underscore the often-underestimated scope and depth of these breaches, emphasizing the entire timeline post-incident and the urgent need for fortified preventative and reactive measures in the digital domain.展开更多
The time-frequency analysis and anomaly detection of wavelet transformation make the method irresistibly advan- tageous in non-stable signal processing. In the paper, the two characteristics are analyzed and demonstr...The time-frequency analysis and anomaly detection of wavelet transformation make the method irresistibly advan- tageous in non-stable signal processing. In the paper, the two characteristics are analyzed and demonstrated with synthetic signal. By applying wavelet transformation to deformation data processing, we find that about 4 months before strong earthquakes, several deformation stations near the epicenter received at the same time the abnormal signal with the same frequency and the period from several days to more than ten days. The GPS observation sta- tions near the epicenter all received the abnormal signal whose period is from 3 months to half a year. These ab- normal signals are possibly earthquake precursors.展开更多
The time cost of ridesharing rental represents a crucial factor influencing users'decisions to rent a car.Researchers have explored this aspect through text analysis and questionnaires.However,the current research...The time cost of ridesharing rental represents a crucial factor influencing users'decisions to rent a car.Researchers have explored this aspect through text analysis and questionnaires.However,the current research faces limitations in terms of data quantity and analysis methods,preventing the extraction of key information.Therefore,there is a need to further optimize the level of public opinion analysis.This study aimed to investigate user perspectives concerning travel time in ridesharing,both pre and post-pandemic,within the Twitter application.Our analysis focused on a dataset from users residing in the USA and India,with considerations for demographic variables such as age and gender.To accomplish our research objectives,we employed Latent Dirichlet Allocation for topic modeling and BERT for sentiment analysis.Our findings revealed significant influences of the pandemic and the user's country of origin on sentiment.Notably,there was a discernible increase in positive sentiment among users from both countries following the pandemic,particularly among older individuals.These findings bear relevance to the ridesharing industry,offering insights that can aid in establishing benchmarks for improving travel time.Such improvements are instrumental in enabling ridesharing companies to effectively compete with other public transportation alternatives.展开更多
This paper presents some installation and data analysis issues from an ongoing urban air temperature and humidity measurement campaign in Hangzhou and Ningbo, China. The location of the measurement sites, the position...This paper presents some installation and data analysis issues from an ongoing urban air temperature and humidity measurement campaign in Hangzhou and Ningbo, China. The location of the measurement sites, the positioning of the sensors and the harsh conditions in an urban environment can result in missing values and observations that are unrepresentative of the local urban microclimate. Missing data and erroneous values in micro-scale weather time series can produce bias in the data analysis, false correlations and wrong conclusions when deriving the specific local weather patterns. A methodology is presented for the identification of values that could be false and for determining whether these are “noise”. Seven statistical methods were evaluated in their performance for replacing missing and erroneous values in urban weather time series. The two methods that proposed replacement with the mean values from sensors in locations with a Sky View Factor similar to that of the target sensor and the sensors closest to the target’s location performed well for all Day-Night and Cold-Warm days scenarios. However, during night time in warm weather the replacement with the mean values for air temperature of the nearest locations outperformed all other methods. The results give some initial evidence of the distinctive urban microclimate development in time and space under different regional weather forcings.展开更多
Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspon...Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspondence analysis to a large space-time data set for multiple environmental variables is shown. In particular, nitrogen dioxide and carbon monoxide hourly concentrations measured during January 1999 at several monitored stations in a district of Northern Italy are analyzed. The procedure consists in transforming the continuous variables into categorical ones by the means of appropriate indicator variables, generating special contingency tables and applying correspondence analysis. The use of this classical multivariate technique allows the identification of important relationships among pollution levels and monitoring stations and/or relationships among pollution levels and observation times.展开更多
The objective of this paper is to utilize images of spatial and temporal fluctuations of temperature over the Earth to study the global climate variation. We illustrated that monthly temperature observations from weat...The objective of this paper is to utilize images of spatial and temporal fluctuations of temperature over the Earth to study the global climate variation. We illustrated that monthly temperature observations from weather stations could be decomposed as components with different time scales based on their spectral distribution. Kolmogorov-Zurbenko (KZ) filters were applied to smooth and interpolate gridded temperature data to construct global maps for long-term (≥ 6 years) trends and El Nino-like (2 to 5 years) movements over the time period of 1893 to 2008. Annual temperature seasonality, latitude and altitude effects have been carefully accounted for to capture meaningful spatiotemporal patterns of climate variability. The result revealed striking facts about global temperature anomalies for specific regions. Correlation analysis and the movie of thermal maps for El Nino-like component clearly supported the existence of such climate fluctuations in time and space.展开更多
In order to provide important parameters for schedule designing, decision-making bases for transit operation management and references for passengers traveling by bus, bus transit travel time reliability is analyzed a...In order to provide important parameters for schedule designing, decision-making bases for transit operation management and references for passengers traveling by bus, bus transit travel time reliability is analyzed and evaluated based on automatic vehicle location (AVL) data. Based on the statistical analysis of the bus transit travel time, six indices including the coefficient of variance, the width of travel time distribution, the mean commercial speed, the congestion frequency, the planning time index and the buffer time index are proposed. Moreover, a framework for evaluating bus transit travel time reliability is constructed. Finally, a case study on a certain bus route in Suzhou is conducted. Results show that the proposed evaluation index system is simple and intuitive, and it can effectively reflect the efficiency and stability of bus operations. And a distinguishing feature of bus transit travel time reliability is the temporal pattern. It varies across different time periods.展开更多
This paper examines how cybersecurity is developing and how it relates to more conventional information security. Although information security and cyber security are sometimes used synonymously, this study contends t...This paper examines how cybersecurity is developing and how it relates to more conventional information security. Although information security and cyber security are sometimes used synonymously, this study contends that they are not the same. The concept of cyber security is explored, which goes beyond protecting information resources to include a wider variety of assets, including people [1]. Protecting information assets is the main goal of traditional information security, with consideration to the human element and how people fit into the security process. On the other hand, cyber security adds a new level of complexity, as people might unintentionally contribute to or become targets of cyberattacks. This aspect presents moral questions since it is becoming more widely accepted that society has a duty to protect weaker members of society, including children [1]. The study emphasizes how important cyber security is on a larger scale, with many countries creating plans and laws to counteract cyberattacks. Nevertheless, a lot of these sources frequently neglect to define the differences or the relationship between information security and cyber security [1]. The paper focus on differentiating between cybersecurity and information security on a larger scale. The study also highlights other areas of cybersecurity which includes defending people, social norms, and vital infrastructure from threats that arise from online in addition to information and technology protection. It contends that ethical issues and the human factor are becoming more and more important in protecting assets in the digital age, and that cyber security is a paradigm shift in this regard [1].展开更多
To understand the variations in vegetation and their correlation with climate factors in the upper catchments of the Yellow River, China, Normalized Difference Vegetation Index(NDVI) time series data from 2000 to 20...To understand the variations in vegetation and their correlation with climate factors in the upper catchments of the Yellow River, China, Normalized Difference Vegetation Index(NDVI) time series data from 2000 to 2010 were collected based on the MOD13Q1 product. The coefficient of variation, Theil–Sen median trend analysis and the Mann–Kendall test were combined to investigate the volatility characteristic and trend characteristic of the vegetation. Climate data sets were then used to analyze the correlation between variations in vegetation and climate change. In terms of the temporal variations, the vegetation in this study area improved slightly from 2000 to 2010, although the volatility characteristic was larger in 2000–2005 than in 2006–2010. In terms of the spatial variation, vegetation which is relatively stable and has a significantly increasing trend accounts for the largest part of the study area. Its spatial distribution is highly correlated with altitude, which ranges from about 2000 to 3000 m in this area. Highly fluctuating vegetation and vegetation which showed a significantly decreasing trend were mostly distributed around the reservoirs and in the reaches of the river with hydropower developments. Vegetation with a relatively stable and significantly decreasing trend and vegetation with a highly fluctuating and significantly increasing trend are widely dispersed. With respect to the response of vegetation to climate change, about 20–30% of the vegetation has a significant correlation with climatic factors and the correlations in most areas are positive: regions with precipitation as the key influencing factor account for more than 10% of the area; regions with temperature as the key influencing factor account for less than 10% of the area; and regions with precipitation and temperature as the key influencing factors together account for about 5% of the total area. More than 70% of the vegetation has an insignificant correlation with climatic factors.展开更多
文摘Experimental and theoretical studies of the mechanisms of vibration stimulation of oil recovery in watered fields lead to the conclusion that resonance oscillations develop in fractured-block formations. These oscillations, caused by weak but long-lasting and frequency-stable influences, create the conditions for ultrasonic wave’s generation in the layers, which are capable of destroying thickened oil membranes in reservoir cracks. For fractured-porous reservoirs in the process of exploitation by the method of water high-pressure oil displacement, the possibility of intensifying ultrasonic vibrations can have an important technological significance. Even a very weak ultrasound can destroy, over a long period of time, the viscous oil membranes formed in the cracks between the blocks, which can be the reason for lowering the permeability of the layers and increasing the oil recovery. To describe these effects, it is necessary to consider the wave process in a hierarchically blocky environment and theoretically simulate the mechanism of the appearance of self-oscillations under the action of relaxation shear stresses. For the analysis of seism acoustic response in time on fixed intervals along the borehole an algorithm of phase diagrams of the state of many-phase medium is suggested.
基金supported by the Strategic Priority Research Program of Chinese Academy of Sciences(grant No.XDA0350502)the National SKA Program of China(grant No.2020SKA0120103)the National Natural Science Foundation of China(NSFC,Grant Nos.U1831130 and 11973046)。
文摘Clock difference between the ensemble pulsar timescale(PT)and the International Atomic Time(TAI)PT-TAI derived from the International Pulsar Timing Array(IPTA)data set indicates a very similar variation trend with the Terrestrial Time TT(BIPMXXXX)-TAI but PT has larger measurement error.In this paper,we discuss the smoothing method of PT using a combined smoothing filter and compare the results with that from other filters.The clock difference sequence between PT-TAI and the first time derivative series of the TT(BIPMXXXX)-TAI can be combined by a combined smoothing filter to yield two smooth curves tied by the constraints assuring that the latter is the derivative of the former.The ensemble pulsar time IPTA2016 with respect to TAI published by G.Hobbs et al.and first time derivative series of the TT(BIPM2017)-TAI with quadratic polynomial terms removed are processed by combined smoothing filter in order to demonstrate the properties of the smoothed results.How to correctly estimate two smoothing coefficients is described and the output results of the combined smoothing filter are analyzed.The results show that the combined smoothing method efficiently removes high frequency noises of two input data series and the smoothed data of the PT-TAI combine long term fractional frequency stability of the pulsar time and frequency accuracy of the terrestrial time.Fractional frequency stability analysis indicates that both short and medium time interval stability of the smoothed PT-TAI is improved while keeping its original long term frequency stability level.The combined smoothing filter is more suitable for smoothing observational pulsar timescale data than any filter that only performs smoothing of a single pulsar time series.The smoothed pulsar time by combined smoothing filter is a pulsar atomic time combined timescale.This kind of combined timescale can also be used as terrestrial time.
文摘A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of interest are first identified in geomagnetic time series by inspection. Time series of these "training events" are represented in matrix form and transpose-multiplied to generate time- domain covariance matrices. The ranked eigenvectors of this matrix are stored as a feature of the pulsation. In the second stage of the algorithm, a sliding window (approxi- mately the width of the training event) is moved across the vector-valued time-series comprising the channels on which the training event was observed. At each window position, the data covariance matrix and associated eigen- vectors are calculated. We compare the orientation of the dominant eigenvectors of the training data to those from the windowed data and flag windows where the dominant eigenvectors directions are similar. This was successful in automatically identifying pulses which share polarization and appear to be from the same source process. We apply the method to a case study of continuously sampled (50 Hz) data from six observatories, each equipped with three- component induction coil magnetometers. We examine a 90-day interval of data associated with a cluster of four observatories located within 50 km of Napa, California, together with two remote reference stations-one 100 km to the north of the cluster and the other 350 km south. When the training data contains signals present in the remote reference observatories, we are reliably able to identify and extract global geomagnetic signals such as solar-generated noise. When training data contains pulsations only observed in the cluster of local observatories, we identify several types of non-plane wave signals having similar polarization.
基金The National Natural Science Foundation of China(No.51338003,71801041)
文摘As an essential component of bus dwelling time, passenger boarding time has a significant impact on bus running reliability and service quality. In order to understand the passengers’ boarding process and mitigate passenger boarding time, a regression analysis framework is proposed to capture the difference and influential factors of boarding time for adult and elderly passengers based on smart card data from Changzhou. Boarding gap, the time difference between two consecutive smart card tapping records, is calculated to approximate passenger boarding time. Analysis of variance is applied to identify whether the difference in boarding time between adults and seniors is statistically significant. The multivariate regression modeling approach is implemented to analyze the influences of passenger types, marginal effects of each additional boarding passenger and bus floor types on the total boarding time at each stop. Results show that a constant difference exists in boarding time between adults and seniors even without considering the specific bus characteristics. The average passenger boarding time decreases when the number of passenger increases. The existence of two entrance steps delays the boarding process, especially for elderly passengers.
基金supported by the 973 Program(Grant No.2006CB403606)the National Natural Science Foundation of China(Grant No.40606008).
文摘This study aims at assessing the relative impacts of four major components of the tropical Pacific Ocean observing system on assimilation of temperature and salinity fields. Observations were collected over a period between January 2001 through June 2003 including temperature data from the expendable bathythermographs (XBT), thermistor data from the Tropical Ocean Global Atmosphere Tropical Atmosphere-Ocean (TOGA-TAO) mooring array, sea level anomalies from the Topex/Poseidon and Jason-1 altimetry (T/P-J), and temperature and salinity profiles from the Array for Real-time Geostrophic Oceanography (ARGO) floats. An efficient three-dimensional variational analysis-based method was introduced to assimilate the above data into the tropical-Pacific circulation model. To evaluate the impact of the individual component of the observing system, four observation system experiments were carried out. The experiment that assimilated all four components of the observing system was taken as the reference. The other three experiments were implemented by withholding one of the four components. Results show that the spatial distribution of the data influences its relative contribution. XBT observations produce the most distinguished effects on temperature analyses in the off-equatorial region due to the large amount of measurements and high quality. Similarly, the impact of TAO is dominant in the equatorial region due to the focus of the spatial distribution. The Topex/Poseidon-Jason-1 can be highly complementary where the XBT and TAO observations are sparse. The contribution of XBT or TAO on the assimilated salinity is made by the model dynamics because no salinity observations from them are assimilated. Therefore, T/P-J, as a main source for providing salinity data, has been shown to have greater impacts than either XBT or TAO on the salinity analysis. Although ARGO includes the subsurface observations, the relatively smaller number of observation makes it have the smallest contribution to the assimilation system.
基金supported by the Technology Innovation Program(10083633,Development on Big Data Analysis Technology and Business Service for Connected Vehicles)funded by the Ministry of Trade,Industry&Energy(MOTIE,Korea)。
文摘In this study,we developed software for vehicle big data analysis to analyze the time-series data of connected vehicles.We designed two software modules:The rst to derive the Pearson correlation coefcients to analyze the collected data and the second to conduct exploratory data analysis of the collected vehicle data.In particular,we analyzed the dangerous driving patterns of motorists based on the safety standards of the Korea Transportation Safety Authority.We also analyzed seasonal fuel efciency(four seasons)and mileage of vehicles,and identied rapid acceleration,rapid deceleration,sudden stopping(harsh braking),quick starting,sudden left turn,sudden right turn and sudden U-turn driving patterns of vehicles.We implemented the density-based spatial clustering of applications with a noise algorithm for trajectory analysis based on GPS(Global Positioning System)data and designed a long shortterm memory algorithm and an auto-regressive integrated moving average model for time-series data analysis.In this paper,we mainly describe the development environment of the analysis software,the structure and data ow of the overall analysis platform,the conguration of the collected vehicle data,and the various algorithms used in the analysis.Finally,we present illustrative results of our analysis,such as dangerous driving patterns that were detected.
文摘The aim of this study is to establish the prevailing conditions of changing climatic trends and change point dates in four selected meteorological stations of Uyo, Benin, Port Harcourt, and Warri in the Niger Delta region of Nigeria. Using daily or 24-hourly annual maximum series (AMS) data with the Indian Meteorological Department (IMD) and the modified Chowdury Indian Meteorological Department (MCIMD) models were adopted to downscale the time series data. Mann-Kendall (MK) trend and Sen’s Slope Estimator (SSE) test showed a statistically significant trend for Uyo and Benin, while Port Harcourt and Warri showed mild trends. The Sen’s Slope magnitude and variation rate were 21.6, 10.8, 6.00 and 4.4 mm/decade, respectively. The trend change-point analysis showed the initial rainfall change-point dates as 2002, 2005, 1988, and 2000 for Uyo, Benin, Port Harcourt, and Warri, respectively. These prove positive changing climatic conditions for rainfall in the study area. Erosion and flood control facilities analysis and design in the Niger Delta will require the application of Non-stationary IDF modelling.
文摘Clustering is used to gain an intuition of the struc tures in the data.Most of the current clustering algorithms pro duce a clustering structure even on data that do not possess such structure.In these cases,the algorithms force a structure in the data instead of discovering one.To avoid false structures in the relations of data,a novel clusterability assessment method called density-based clusterability measure is proposed in this paper.I measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningfu insight to the relationships in the data.This is especially useful in time-series data since visualizing the structure in time-series data is hard.The performance of the clusterability measure is evalu ated against several synthetic data sets and time-series data sets which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
文摘Opinion (sentiment) analysis on big data streams from the constantly generated text streams on social media networks to hundreds of millions of online consumer reviews provides many organizations in every field with opportunities to discover valuable intelligence from the massive user generated text streams. However, the traditional content analysis frameworks are inefficient to handle the unprecedentedly big volume of unstructured text streams and the complexity of text analysis tasks for the real time opinion analysis on the big data streams. In this paper, we propose a parallel real time sentiment analysis system: Social Media Data Stream Sentiment Analysis Service (SMDSSAS) that performs multiple phases of sentiment analysis of social media text streams effectively in real time with two fully analytic opinion mining models to combat the scale of text data streams and the complexity of sentiment analysis processing on unstructured text streams. We propose two aspect based opinion mining models: Deterministic and Probabilistic sentiment models for a real time sentiment analysis on the user given topic related data streams. Experiments on the social media Twitter stream traffic captured during the pre-election weeks of the 2016 Presidential election for real-time analysis of public opinions toward two presidential candidates showed that the proposed system was able to predict correctly Donald Trump as the winner of the 2016 Presidential election. The cross validation results showed that the proposed sentiment models with the real-time streaming components in our proposed framework delivered effectively the analysis of the opinions on two presidential candidates with average 81% accuracy for the Deterministic model and 80% for the Probabilistic model, which are 1% - 22% improvements from the results of the existing literature.
文摘Weighted priority queueing is a modification of priority queueing that eliminates the possibility of blocking lower priority traffic. The weights assigned to priority classes determine the fractions of the bandwith that are guaranteed for individual traffic classes, similarly as in weighted fair queueing. The paper describes a timed Petri net model of weighted priority queueing and uses discrete-event simulation of this model to obtain performance characteristics of simple queueing systems. The model is also used to analyze the effects of finite queue capacity on the performance of queueing systems.
文摘In this in-depth exploration, I delve into the complex implications and costs of cybersecurity breaches. Venturing beyond just the immediate repercussions, the research unearths both the overt and concealed long-term consequences that businesses encounter. This study integrates findings from various research, including quantitative reports, drawing upon real-world incidents faced by both small and large enterprises. This investigation emphasizes the profound intangible costs, such as trade name devaluation and potential damage to brand reputation, which can persist long after the breach. By collating insights from industry experts and a myriad of research, the study provides a comprehensive perspective on the profound, multi-dimensional impacts of cybersecurity incidents. The overarching aim is to underscore the often-underestimated scope and depth of these breaches, emphasizing the entire timeline post-incident and the urgent need for fortified preventative and reactive measures in the digital domain.
基金Joint Seismological Science Foundation of China (604021) and National Natural Science Foundation of China(40074024).
文摘The time-frequency analysis and anomaly detection of wavelet transformation make the method irresistibly advan- tageous in non-stable signal processing. In the paper, the two characteristics are analyzed and demonstrated with synthetic signal. By applying wavelet transformation to deformation data processing, we find that about 4 months before strong earthquakes, several deformation stations near the epicenter received at the same time the abnormal signal with the same frequency and the period from several days to more than ten days. The GPS observation sta- tions near the epicenter all received the abnormal signal whose period is from 3 months to half a year. These ab- normal signals are possibly earthquake precursors.
基金supported by the Chinese National Natural Science Foundation(52172348)the Postdoctoral Research Foundation of China.
文摘The time cost of ridesharing rental represents a crucial factor influencing users'decisions to rent a car.Researchers have explored this aspect through text analysis and questionnaires.However,the current research faces limitations in terms of data quantity and analysis methods,preventing the extraction of key information.Therefore,there is a need to further optimize the level of public opinion analysis.This study aimed to investigate user perspectives concerning travel time in ridesharing,both pre and post-pandemic,within the Twitter application.Our analysis focused on a dataset from users residing in the USA and India,with considerations for demographic variables such as age and gender.To accomplish our research objectives,we employed Latent Dirichlet Allocation for topic modeling and BERT for sentiment analysis.Our findings revealed significant influences of the pandemic and the user's country of origin on sentiment.Notably,there was a discernible increase in positive sentiment among users from both countries following the pandemic,particularly among older individuals.These findings bear relevance to the ridesharing industry,offering insights that can aid in establishing benchmarks for improving travel time.Such improvements are instrumental in enabling ridesharing companies to effectively compete with other public transportation alternatives.
基金L.B.would like to thank the“Liveable Cities Project”for funding a visit to Hangzhou and Ningbo in China for researching on the urban micro-climate and to collabo-rate with the Centre for Sustainable Energy Technologies at the University of Nottingham Ningbo(EPSRC funded:EP/J017698/1)The installation work of the sensors’network in Hangzhou and Ningbo is supported by the Ningbo Natu-ral Science Foundation(No.2012A610173)the Ningbo Housing and Urban-Rural Development Committee(No.201206).
文摘This paper presents some installation and data analysis issues from an ongoing urban air temperature and humidity measurement campaign in Hangzhou and Ningbo, China. The location of the measurement sites, the positioning of the sensors and the harsh conditions in an urban environment can result in missing values and observations that are unrepresentative of the local urban microclimate. Missing data and erroneous values in micro-scale weather time series can produce bias in the data analysis, false correlations and wrong conclusions when deriving the specific local weather patterns. A methodology is presented for the identification of values that could be false and for determining whether these are “noise”. Seven statistical methods were evaluated in their performance for replacing missing and erroneous values in urban weather time series. The two methods that proposed replacement with the mean values from sensors in locations with a Sky View Factor similar to that of the target sensor and the sensors closest to the target’s location performed well for all Day-Night and Cold-Warm days scenarios. However, during night time in warm weather the replacement with the mean values for air temperature of the nearest locations outperformed all other methods. The results give some initial evidence of the distinctive urban microclimate development in time and space under different regional weather forcings.
文摘Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspondence analysis to a large space-time data set for multiple environmental variables is shown. In particular, nitrogen dioxide and carbon monoxide hourly concentrations measured during January 1999 at several monitored stations in a district of Northern Italy are analyzed. The procedure consists in transforming the continuous variables into categorical ones by the means of appropriate indicator variables, generating special contingency tables and applying correspondence analysis. The use of this classical multivariate technique allows the identification of important relationships among pollution levels and monitoring stations and/or relationships among pollution levels and observation times.
文摘The objective of this paper is to utilize images of spatial and temporal fluctuations of temperature over the Earth to study the global climate variation. We illustrated that monthly temperature observations from weather stations could be decomposed as components with different time scales based on their spectral distribution. Kolmogorov-Zurbenko (KZ) filters were applied to smooth and interpolate gridded temperature data to construct global maps for long-term (≥ 6 years) trends and El Nino-like (2 to 5 years) movements over the time period of 1893 to 2008. Annual temperature seasonality, latitude and altitude effects have been carefully accounted for to capture meaningful spatiotemporal patterns of climate variability. The result revealed striking facts about global temperature anomalies for specific regions. Correlation analysis and the movie of thermal maps for El Nino-like component clearly supported the existence of such climate fluctuations in time and space.
基金The Soft Science Research Project of Ministry of Housing and Urban-Rural Development of China (No. 2008-k5-14)
文摘In order to provide important parameters for schedule designing, decision-making bases for transit operation management and references for passengers traveling by bus, bus transit travel time reliability is analyzed and evaluated based on automatic vehicle location (AVL) data. Based on the statistical analysis of the bus transit travel time, six indices including the coefficient of variance, the width of travel time distribution, the mean commercial speed, the congestion frequency, the planning time index and the buffer time index are proposed. Moreover, a framework for evaluating bus transit travel time reliability is constructed. Finally, a case study on a certain bus route in Suzhou is conducted. Results show that the proposed evaluation index system is simple and intuitive, and it can effectively reflect the efficiency and stability of bus operations. And a distinguishing feature of bus transit travel time reliability is the temporal pattern. It varies across different time periods.
文摘This paper examines how cybersecurity is developing and how it relates to more conventional information security. Although information security and cyber security are sometimes used synonymously, this study contends that they are not the same. The concept of cyber security is explored, which goes beyond protecting information resources to include a wider variety of assets, including people [1]. Protecting information assets is the main goal of traditional information security, with consideration to the human element and how people fit into the security process. On the other hand, cyber security adds a new level of complexity, as people might unintentionally contribute to or become targets of cyberattacks. This aspect presents moral questions since it is becoming more widely accepted that society has a duty to protect weaker members of society, including children [1]. The study emphasizes how important cyber security is on a larger scale, with many countries creating plans and laws to counteract cyberattacks. Nevertheless, a lot of these sources frequently neglect to define the differences or the relationship between information security and cyber security [1]. The paper focus on differentiating between cybersecurity and information security on a larger scale. The study also highlights other areas of cybersecurity which includes defending people, social norms, and vital infrastructure from threats that arise from online in addition to information and technology protection. It contends that ethical issues and the human factor are becoming more and more important in protecting assets in the digital age, and that cyber security is a paradigm shift in this regard [1].
基金National Natural Science Foundation of China,No.41171318 National Key Technology Support Program,No.2012BAH32B03+1 种基金No.2012BAH33B05 The Remote Sensing Investigation and Assessment Project for Decade-Change of the National Ecological Environment(2000–2010)
文摘To understand the variations in vegetation and their correlation with climate factors in the upper catchments of the Yellow River, China, Normalized Difference Vegetation Index(NDVI) time series data from 2000 to 2010 were collected based on the MOD13Q1 product. The coefficient of variation, Theil–Sen median trend analysis and the Mann–Kendall test were combined to investigate the volatility characteristic and trend characteristic of the vegetation. Climate data sets were then used to analyze the correlation between variations in vegetation and climate change. In terms of the temporal variations, the vegetation in this study area improved slightly from 2000 to 2010, although the volatility characteristic was larger in 2000–2005 than in 2006–2010. In terms of the spatial variation, vegetation which is relatively stable and has a significantly increasing trend accounts for the largest part of the study area. Its spatial distribution is highly correlated with altitude, which ranges from about 2000 to 3000 m in this area. Highly fluctuating vegetation and vegetation which showed a significantly decreasing trend were mostly distributed around the reservoirs and in the reaches of the river with hydropower developments. Vegetation with a relatively stable and significantly decreasing trend and vegetation with a highly fluctuating and significantly increasing trend are widely dispersed. With respect to the response of vegetation to climate change, about 20–30% of the vegetation has a significant correlation with climatic factors and the correlations in most areas are positive: regions with precipitation as the key influencing factor account for more than 10% of the area; regions with temperature as the key influencing factor account for less than 10% of the area; and regions with precipitation and temperature as the key influencing factors together account for about 5% of the total area. More than 70% of the vegetation has an insignificant correlation with climatic factors.