N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insi...N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level.Although a variety of identification algorithms have been proposed recently,most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences,while ignoring the structural dependencies of nucleotides in their threedimensional structures.To overcome this issue,we propose a cross-species end-to-end deep learning model,namely CR-NSSD,which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification.Specifically,CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory.It then constructs a crossdomain reconstruction encoder to learn the sequential and structural dependencies between nucleotides.By minimizing the reconstruction and binary cross-entropy losses,CR-NSSD is trained to complete the task of m6A site identification.Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms.Moreover,the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species,thus improving the accuracy of cross-species identification.展开更多
Thucydides asserts that the occupation of Decelea by the Spartans in 413 BC made the grain supply for Athens costly by forcing the transport from land onto the sea.This calls into question the well-established consens...Thucydides asserts that the occupation of Decelea by the Spartans in 413 BC made the grain supply for Athens costly by forcing the transport from land onto the sea.This calls into question the well-established consensus that sea transport was far cheaper than land transport.This paper contends that the cost of protecting supply lines-specifically the expenses associated with the warships which escorted the supply ships-rendered the grain transported on the new route exceptionally costly.In this paper,the benefits and drawbacks of a maritime economy,including transaction costs,trade dependencies,and the capabilities of warships and supply ships are discussed.展开更多
The escalation of compound extreme events has resulted in noteworthy economic and property losses.Recognizing the intricate interconnections among these events has become imperative.To tackle this challenge,we have fo...The escalation of compound extreme events has resulted in noteworthy economic and property losses.Recognizing the intricate interconnections among these events has become imperative.To tackle this challenge,we have formulated a comprehensive framework for the systematic analysis of their dependencies.This framework consists of three steps.(1)Define extreme events using Mahalanobis distance thresholds.(2)Represent dependencies among multiple extreme events through a point process-based method.(3)Verify dependencies with residual tail coefficients,determining thefinal dependency structure.Applying this framework to assess the extreme dependence of precipitation on wind speed and temperature in China,revealed four distinct dependency structures.In northern,Jianghuai,and southern China,precipitation heavily relies on wind speed,while tempera-tures maintain relative independence.In northeastern and northwestern China,precipitation exhibits relative independence,yet a notable dependence exists between temperatures and wind speed.In southwestern China,precipitation strongly depends on temperature,while wind speed remains relatively indepen-dent.The Qinghai–Tibet Plateau region displays a significant dependence relationship among precipitation,wind speed,and temperature,with weaker dependence between extreme wind speed and temperature.This framework is instrumental for analyzing dependencies among extreme values in compound events.展开更多
In this paper, the definition of approximate XFDs based on value equality is proposed. Two metrics, sup port and strength, are presented for measuring the degree of approximate XFD. A basic algorithm is designed for e...In this paper, the definition of approximate XFDs based on value equality is proposed. Two metrics, sup port and strength, are presented for measuring the degree of approximate XFD. A basic algorithm is designed for extracting minimal set of approximate XFDs, and then two optimized strategies are proposed to improve the performance. Finally, the experimental results show that the optimized algorithms are correct and effective.展开更多
Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is v...Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data Anomalies may be due to the poverty of their semantic descriptions, or even the absence of their description. In this paper, we propose an approach to better understand the semantics and the structure of the data. Our approach helps to correct automatically the intra-column anomalies and the inter-col- umns ones. We aim to improve the quality of data by processing the null values and the semantic dependencies between columns.展开更多
According to the analysis of existing complicated functional dependencies constraint, we conclude the conditions of defining functional dependency in XML, and then we introduce the concept of the node value equality. ...According to the analysis of existing complicated functional dependencies constraint, we conclude the conditions of defining functional dependency in XML, and then we introduce the concept of the node value equality. A new path language and a new definition of functional dependencies in XML (XFD) are proposed XFD includes the relative XFD and the absolute XFD, in which absolute key and relative key are the particular cases. We focus on the logical implication and the closure problems, and propose a group of inference rules. Finally, some proofs of the correctness and completeness are given. XFD is powerful on expressing functional dependencies in XML causing data redundancy, and has a complete axiom system.展开更多
Theory of rough sets, proposed by Zdzislaw Pawlak in 1982, is a model of approximate reasoning. In applications, rough set methodology focuses on approximate representation of knowledge derivable from data. It leads t...Theory of rough sets, proposed by Zdzislaw Pawlak in 1982, is a model of approximate reasoning. In applications, rough set methodology focuses on approximate representation of knowledge derivable from data. It leads to significant results in many areas including, for example, finance, industry, multimedia, medicine, and most recently bioinformatics.展开更多
In the field of data-driven bearing fault diagnosis,convolutional neural network(CNN)has been widely researched and applied due to its superior feature extraction and classification ability.However,the convolutional o...In the field of data-driven bearing fault diagnosis,convolutional neural network(CNN)has been widely researched and applied due to its superior feature extraction and classification ability.However,the convolutional operation could only process a local neighborhood at a time and thus lack the ability of capturing long-range dependencies.Therefore,building an efficient learning method for long-range dependencies is crucial to comprehend and express signal features considering that the vibration signals obtained in a real industrial environment always have strong instability,periodicity,and temporal correlation.This paper introduces nonlocal mean to the CNN and presents a 1D nonlocal block(1D-NLB)to extract long-range dependencies.The 1D-NLB computes the response at a position as a weighted average value of the features at all positions.Based on it,we propose a nonlocal 1D convolutional neural network(NL-1DCNN)aiming at rolling bearing fault diagnosis.Furthermore,the 1D-NLB could be simply plugged into most existing deep learning architecture to improve their fault diagnosis ability.Under multiple noise conditions,the 1D-NLB improves the performance of the CNN on the wheelset bearing data set of high-speed train and the Case Western Reserve University bearing data set.The experiment results show that the NL-1DCNN exhibits superior results compared with six state-of-the-art fault diagnosis methods.展开更多
This study examines the time and regime dependencies of sensitive areas identified by the conditional nonlinear optiflml perturbation (CNOP) method for forecasts of two typhoons. Typhoon Meari (2004) was weakly no...This study examines the time and regime dependencies of sensitive areas identified by the conditional nonlinear optiflml perturbation (CNOP) method for forecasts of two typhoons. Typhoon Meari (2004) was weakly nonlinear and is herein referred to as the linear case, while Typhoon Matsa (2005) was strongly nonlinear and is herein referred to as the nonlinear case. In the linear case, the sensitive areas identified for special forecast times when the initial time was fixed resembled those identified for other forecast times. Targeted observations deployed to improve a special time forecast would thus also benefit forecasts at other times. In the nonlinear case, the similarities among the sensitive areas identified for different forecast times were more limited. The deployment of targeted observations in the nonlinear case would therefore need to be adapted to achieve large improvements for different targeted forecasts. For both cases, the closer the forecast time, the higher the similarities of the sensitive areas. When the forecast time was fixed, the sensitive areas in the linear case diverged continuously from the verification area as the forecast period lengthened, while those in the nonlinear case were always located around the initial cyclones. The deployment of targeted observations to improve a special forecast depends strongly on the time of deployment. An examination of the efficiency gained by reducing initial errors within the identified sensitive areas confirmed these results. In general, the greatest improvement in a special time forecast was obtained by identifying the sensitive areas for the corresponding forecast time period.展开更多
This paper summarizes the main instrumental and methodological points of the tidal research which was performed in the framework of the National Scientific Research Fund Project K101603. Since the project is still run...This paper summarizes the main instrumental and methodological points of the tidal research which was performed in the framework of the National Scientific Research Fund Project K101603. Since the project is still running the tidal analysis results published here are only preliminary. Unmodelled tidal effects have been highlighted in some recent absolute gravity measurements carried out in the Pannonian basin resulting in a periodic modulation exceeding the typical standard deviations (±1microGal) of the drop sets. Since the most dominant source of the daily gravity variation is the bulk tidal effect, the goal of the project is to check its location dependency at BGal level. Unfortunately Hungary has had no dedicated instrumentation, so an effort was made to make the available LaCoste- Romberg spring G meters capable for continuous recording. As a reference instrument the GWR SG025 operated in the Conrad Observatory, Austria was also used and in the mean time of the project, a Scintrex CG-5 became also available, Eventually 6 instruments at 5 different locations were operated for 3 9 months mainly in co-located configuration. Although many experiments (moving mass calibrations) were done to determine the scale factors and scale functions of the instruments, the direct comparison of the tidal parameters obtained from the observations is still questionable. Therefore the ratio of the delta factors of O1 and M2 tidal constituents was investigated supposing that M2 is much more influenced by the ocean loading effect than O1. The slight detected increase of δ(O1 )/δ(M2) (≈0.2%) toward east does not contradict to theory. This result has to be validated in the near future by analyzing available ocean loading models.展开更多
With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependenc...With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependencies, resulting in the inflexibility of the design and implement for the processes. This paper proposes a novel data-aware business process model which is able to describe both explicit control flow and implicit data flow. Data model with dependencies which are formulated by Linear-time Temporal Logic(LTL) is presented, and their satisfiability is validated by an automaton-based model checking algorithm. Data dependencies are fully considered in modeling phase, which helps to improve the efficiency and reliability of programming during developing phase. Finally, a prototype system based on j BPM for data-aware workflow is designed using such model, and has been deployed to Beijing Kingfore heating management system to validate the flexibility, efficacy and convenience of our approach for massive coding and large-scale system management in reality.展开更多
This study addresses whether gold exhibits the function of a hedge or safe haven as often referred to in academia.It contributes to the existing literature by(i)revisiting this question for the principal stock markets...This study addresses whether gold exhibits the function of a hedge or safe haven as often referred to in academia.It contributes to the existing literature by(i)revisiting this question for the principal stock markets in the Middle East and North Africa(MENA)region and(ii)using the copula-quantile-on-quantile and conditional value at risk methods to detail the risks facing market participants provided with accurate information about various gold and stock market scenarios(i.e.,bear,normal,bull).The results provide strong evidence of quantile dependence between gold and stock returns.Positive correlations are found between MENA gold and stock markets when both are bullish.Conversely,when stock returns are bearish,gold markets show negative correlations with MENA stock markets.The risk spillover from gold to stock markets intensified during the global financial and European crises.Given the risk spillover between gold and stock markets,investors in MENA markets should be careful when considering gold as a safe haven because its effectiveness as a hedge is not the same in all MENA stock markets.Investors and portfolio managers should rebalance their portfolio compositions under various gold and stock market conditions.Overall,such precise insights about the heterogeneous linkages and spillovers between gold and MENA stock returns provide potential input for developing effective hedging strategies and optimal portfolio allocations.展开更多
In uncertain data management, lineages are often used for probability computation of result tuples. However, most of existing works focus on tuple level lineage, which results in imprecise data derivation. Besides, co...In uncertain data management, lineages are often used for probability computation of result tuples. However, most of existing works focus on tuple level lineage, which results in imprecise data derivation. Besides, correlations among attributes cannot be captured. In this paper, for base tuples with multiple uncertain attributes, we define attribute level annotation to annotate each attribute. Utilizing these annotations to generate lineages of result tuples can realize more precise derivation. Simultaneously,they can be used for dependency graph construction. Utilizing dependency graph, we can represent not only constraints on schemas but also correlations among attributes. Combining the dependency graph and attribute level lineage, we can correctly compute probabilities of result tuples and precisely derivate data. In experiments, comparing lineage on tuple level and attribute level, it shows that our method has advantages on derivation precision and storage cost.展开更多
Temperature and doping dependencies of the transport properties have been calculated using an ensemble Monte Carlo simulation. We consider the polar optical phonon, acoustic phonons, piezoelectric, intervalley scatter...Temperature and doping dependencies of the transport properties have been calculated using an ensemble Monte Carlo simulation. We consider the polar optical phonon, acoustic phonons, piezoelectric, intervalley scatterings and Charged impurity scattering model of Ridley;furthermore, a non nonparabolic three-valley model is used. Our simulation results have shown that the electron velocity in GaN is less sensitive to changes in temperature than that associated with GaAs. Also it is found that GaN exhibits high peak drift velocity at room temperature, 2.8 × 105m/s, at doping concentration of 1 × 1020 m–3and the electron drift velocity relaxes to the saturation value of 1.3 × 105 m/s which is much larger than that of GaAs. The weakening of the phonon emission rate at low temperature explains the extremely high low field mobility. Our results suggest that the transport characteristics of GaN are superior to that of GaAs, over a wide range of temperatures, from 100 K to 700 K, and doping concentrations, up to 1 × 1025展开更多
Conspecific negative density dependencies(CNDDs)foster biodiversity through reducing the chances of competitive exclusion in plant communities and have therefore fascinated ecologists.A major driver of CNDDs is plant-...Conspecific negative density dependencies(CNDDs)foster biodiversity through reducing the chances of competitive exclusion in plant communities and have therefore fascinated ecologists.A major driver of CNDDs is plant-soil feedback,and a lot of the literature assumes that the triggers of CNDDs concur with those for plant-soil feedback.Here,we suggest that a core assumption of a lot of the literature on CNDDs,that CNDDs are stronger in AM-associated than ECM-associated trees,is not quite as well supported as widely claimed.We think that dismissing this very important consideration prevents us from identifying a major gap in the literature on CNDDs.The vast majority of the literature on mycorrhiza-induced CNDDs originates from temperate systems,but the findings are extrapolated across divergent ecosystems.We then develop the argument that likely propagule limitations for arbuscular mycorrhizal trees in temperate forests might be inducing stronger CNDDs than they do at propagule sufficiency,which arbuscular mycorrhizal trees usually experience in other systems.We are thus contributing a new hypothesis in the field of mycorrhizal ecology with the potential to unify observations across scales and biomes.展开更多
Precise determination of the Higgs boson self-couplings is essential for understanding the mechanism underlying electroweak symmetry breaking.However,owing to the limited number of Higgs boson pair events at the LHC,o...Precise determination of the Higgs boson self-couplings is essential for understanding the mechanism underlying electroweak symmetry breaking.However,owing to the limited number of Higgs boson pair events at the LHC,only loose constraints have been established to date.Current constraints are based on the assumption that the cross section is a quadratic function of the trilinear Higgs self-coupling within the framework.Incorporating higher-order quantum corrections from virtual Higgs bosons would significantly alter this functional form,introducing new quartic and cubic power dependencies on the trilinear Higgs self-coupling.To derive this new functional form,we propose a specialized renormalization procedure that tracks all Higgs self-couplings at each calculation step.Additionally,we introduce renormalization constants for coupling modifiers within the framework to ensure the cancellation of all ultraviolet divergences.With new functional forms of the cross sections in both the gluon-gluon fusion and vector boson fusion channels,the upper limit of kλ_(3H)=λ_(3H)^(SM)set by the ATLAS(CMS)collaboration is reduced from 6.6(6.49)to 5.4(5.37).However,extracting a meaningful constraint on the quartic Higgs self-coupling from Higgs boson pair production data remains challenging.We also present the invariant mass distributions of the Higgs boson pair at different values of the self-couplings,which could assist in setting optimal cuts for experimental analysis.展开更多
Automation of plant phenotyping using data from high-dimensional imaging sensors is on the forefront of agricultural research for its potential to improve seasonal yield by monitoring crop health and accelerating bree...Automation of plant phenotyping using data from high-dimensional imaging sensors is on the forefront of agricultural research for its potential to improve seasonal yield by monitoring crop health and accelerating breeding programs.A common challenge when capturing images in the field relates to the spectral reflection of sunlight(glare)from crop leaves that,at certain solar incidences and sensor viewing angles,presents unwanted signals.The research presented here involves the convergence of 2 parallel projects to develop a facile algorithm that can use polarization data to decouple light reflected from the surface of the leaves and light scattered from the leaf's tissue.展开更多
Predicting human motion based on historical motion sequences is a fundamental problem in computer vision,which is at the core of many applications.Existing approaches primarily focus on encoding spatial dependencies a...Predicting human motion based on historical motion sequences is a fundamental problem in computer vision,which is at the core of many applications.Existing approaches primarily focus on encoding spatial dependencies among human joints while ignoring the temporal cues and the complex relationships across non-consecutive frames.These limitations hinder the model’s ability to generate accurate predictions over longer time horizons and in scenarios with complex motion patterns.To address the above problems,we proposed a novel multi-level spatial and temporal learning model,which consists of a Cross Spatial Dependencies Encoding Module(CSM)and a Dynamic Temporal Connection Encoding Module(DTM).Specifically,the CSM is designed to capture complementary local and global spatial dependent information at both the joint level and the joint pair level.We further present DTM to encode diverse temporal evolution contexts and compress motion features to a deep level,enabling the model to capture both short-term and long-term dependencies efficiently.Extensive experiments conducted on the Human 3.6M and CMU Mocap datasets demonstrate that our model achieves state-of-the-art performance in both short-term and long-term predictions,outperforming existing methods by up to 20.3% in accuracy.Furthermore,ablation studies confirm the significant contributions of the CSM and DTM in enhancing prediction accuracy.展开更多
Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of cer- tain attribute values in pairs of database tuples when some similarity conditions on other values are satisfie...Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of cer- tain attribute values in pairs of database tuples when some similarity conditions on other values are satisfied. Their en- forcement can be seen as a natural generalization of entity resolution. In what we call the pure case of MD enforce- ment, an arbitrary value from the underlying data domain can be used for the value in common that is used for a match- ing. However, the overall number of changes of attribute val- ues is expected to be kept to a minimum. We investigate this case in terms of semantics and the properties of data clean- ing through the enforcement of MDs. We characterize the in- tended clean instances, and also the clean answers to queries, as those that are invariant under the cleaning process. The complexity of computing clean instances and clean query an- swering is investigated. Tractable and intractable cases de- pending on the MDs are identified and characterized.展开更多
Backfill is often employed in mining operations for ground support,with its positive impact on ground stability acknowledged in many underground mines.However,existing studies have predominantly focused only on the st...Backfill is often employed in mining operations for ground support,with its positive impact on ground stability acknowledged in many underground mines.However,existing studies have predominantly focused only on the stress development within the backfill material,leaving the influence of stope backfilling on stress distribution in surrounding rock mass and ground stability largely unexplored.Therefore,this paper presents numerical models in FLAC3D to investigate,for the first time,the time-dependent stress redistribution around a vertical backfilled stope and its implications on ground stability,considering the creep of surrounding rock mass.Using the Soft Soil constitutive model,the compressibility of backfill under large pressure was captured.It is found that the creep deformation of rock mass exercises compression on backfill and results in a less void ratio and increased modulus for fill material.The compacted backfill conversely influenced the stress distribution and ground stability of rock mass which was a combined effect of wall creep and compressibility of backfill.With the increase of time or/and creep deformation,the minimum principal stress in the rocks surrounding the backfilled stope increased towards the pre-mining stress state,while the deviatoric stress reduces leading to an increased factor of safety and improved ground stability.This improvement effect of backfill on ground stability increased with the increase of mine depth and stope height,while it is also more pronounced for the narrow stope,the backfill with a smaller compression index,and the soft rocks with a smaller viscosity coefficient.Furthermore,the results emphasize the importance of minimizing empty time and backfilling extracted stope as soon as possible for ground control.Reduction of filling gap height enhances the local stability around the roof of stope.展开更多
基金supported in part by the National Natural Science Foundation of China(62373348)the Natural Science Foundation of Xinjiang Uygur Autonomous Region(2021D01D05)+1 种基金the Tianshan Talent Training Program(2023TSYCLJ0021)the Pioneer Hundred Talents Program of Chinese Academy of Sciences.
文摘N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level.Although a variety of identification algorithms have been proposed recently,most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences,while ignoring the structural dependencies of nucleotides in their threedimensional structures.To overcome this issue,we propose a cross-species end-to-end deep learning model,namely CR-NSSD,which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification.Specifically,CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory.It then constructs a crossdomain reconstruction encoder to learn the sequential and structural dependencies between nucleotides.By minimizing the reconstruction and binary cross-entropy losses,CR-NSSD is trained to complete the task of m6A site identification.Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms.Moreover,the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species,thus improving the accuracy of cross-species identification.
文摘Thucydides asserts that the occupation of Decelea by the Spartans in 413 BC made the grain supply for Athens costly by forcing the transport from land onto the sea.This calls into question the well-established consensus that sea transport was far cheaper than land transport.This paper contends that the cost of protecting supply lines-specifically the expenses associated with the warships which escorted the supply ships-rendered the grain transported on the new route exceptionally costly.In this paper,the benefits and drawbacks of a maritime economy,including transaction costs,trade dependencies,and the capabilities of warships and supply ships are discussed.
基金National Key R&D Program of China,Grant/Award Number:2022YFC3002705National Natural Science Foundation of China,Grant/Award Number:5220904China Institute of Water Resources and Hydropower Research,Grant/Award Number:SKL2022TS11。
文摘The escalation of compound extreme events has resulted in noteworthy economic and property losses.Recognizing the intricate interconnections among these events has become imperative.To tackle this challenge,we have formulated a comprehensive framework for the systematic analysis of their dependencies.This framework consists of three steps.(1)Define extreme events using Mahalanobis distance thresholds.(2)Represent dependencies among multiple extreme events through a point process-based method.(3)Verify dependencies with residual tail coefficients,determining thefinal dependency structure.Applying this framework to assess the extreme dependence of precipitation on wind speed and temperature in China,revealed four distinct dependency structures.In northern,Jianghuai,and southern China,precipitation heavily relies on wind speed,while tempera-tures maintain relative independence.In northeastern and northwestern China,precipitation exhibits relative independence,yet a notable dependence exists between temperatures and wind speed.In southwestern China,precipitation strongly depends on temperature,while wind speed remains relatively indepen-dent.The Qinghai–Tibet Plateau region displays a significant dependence relationship among precipitation,wind speed,and temperature,with weaker dependence between extreme wind speed and temperature.This framework is instrumental for analyzing dependencies among extreme values in compound events.
基金Supported by the National Natural Science Foun-dation of China (60173051) , Teaching and Research Award Programfor Outstanding Young Teachers in Higher Education Institution ofthe Ministry of Education,the National Research Foundation for theDoctoral Programof Higher Education of China(20030145029) ,andthe Natural Science Foundationfor Doctoral Career Award of LiaoningProvince(20041016)
文摘In this paper, the definition of approximate XFDs based on value equality is proposed. Two metrics, sup port and strength, are presented for measuring the degree of approximate XFD. A basic algorithm is designed for extracting minimal set of approximate XFDs, and then two optimized strategies are proposed to improve the performance. Finally, the experimental results show that the optimized algorithms are correct and effective.
文摘Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data Anomalies may be due to the poverty of their semantic descriptions, or even the absence of their description. In this paper, we propose an approach to better understand the semantics and the structure of the data. Our approach helps to correct automatically the intra-column anomalies and the inter-col- umns ones. We aim to improve the quality of data by processing the null values and the semantic dependencies between columns.
基金Supported by the National Natural Science Foundation of China (60573089)the National High Technology Research and Development Program of China (2006AA09Z139)
文摘According to the analysis of existing complicated functional dependencies constraint, we conclude the conditions of defining functional dependency in XML, and then we introduce the concept of the node value equality. A new path language and a new definition of functional dependencies in XML (XFD) are proposed XFD includes the relative XFD and the absolute XFD, in which absolute key and relative key are the particular cases. We focus on the logical implication and the closure problems, and propose a group of inference rules. Finally, some proofs of the correctness and completeness are given. XFD is powerful on expressing functional dependencies in XML causing data redundancy, and has a complete axiom system.
文摘Theory of rough sets, proposed by Zdzislaw Pawlak in 1982, is a model of approximate reasoning. In applications, rough set methodology focuses on approximate representation of knowledge derivable from data. It leads to significant results in many areas including, for example, finance, industry, multimedia, medicine, and most recently bioinformatics.
基金supported by the State Key Laboratory of Traction Power,Southwest Jiaotong University (TPL2104)the National Natural Science Foundation of China (61833002).
文摘In the field of data-driven bearing fault diagnosis,convolutional neural network(CNN)has been widely researched and applied due to its superior feature extraction and classification ability.However,the convolutional operation could only process a local neighborhood at a time and thus lack the ability of capturing long-range dependencies.Therefore,building an efficient learning method for long-range dependencies is crucial to comprehend and express signal features considering that the vibration signals obtained in a real industrial environment always have strong instability,periodicity,and temporal correlation.This paper introduces nonlocal mean to the CNN and presents a 1D nonlocal block(1D-NLB)to extract long-range dependencies.The 1D-NLB computes the response at a position as a weighted average value of the features at all positions.Based on it,we propose a nonlocal 1D convolutional neural network(NL-1DCNN)aiming at rolling bearing fault diagnosis.Furthermore,the 1D-NLB could be simply plugged into most existing deep learning architecture to improve their fault diagnosis ability.Under multiple noise conditions,the 1D-NLB improves the performance of the CNN on the wheelset bearing data set of high-speed train and the Case Western Reserve University bearing data set.The experiment results show that the NL-1DCNN exhibits superior results compared with six state-of-the-art fault diagnosis methods.
基金supported by the National Natural Science Foundation of China(Grant Nos.41105038and40830955)the NationalKey Technology R&D Program(Grant No.2012BAC22B03)
文摘This study examines the time and regime dependencies of sensitive areas identified by the conditional nonlinear optiflml perturbation (CNOP) method for forecasts of two typhoons. Typhoon Meari (2004) was weakly nonlinear and is herein referred to as the linear case, while Typhoon Matsa (2005) was strongly nonlinear and is herein referred to as the nonlinear case. In the linear case, the sensitive areas identified for special forecast times when the initial time was fixed resembled those identified for other forecast times. Targeted observations deployed to improve a special time forecast would thus also benefit forecasts at other times. In the nonlinear case, the similarities among the sensitive areas identified for different forecast times were more limited. The deployment of targeted observations in the nonlinear case would therefore need to be adapted to achieve large improvements for different targeted forecasts. For both cases, the closer the forecast time, the higher the similarities of the sensitive areas. When the forecast time was fixed, the sensitive areas in the linear case diverged continuously from the verification area as the forecast period lengthened, while those in the nonlinear case were always located around the initial cyclones. The deployment of targeted observations to improve a special forecast depends strongly on the time of deployment. An examination of the efficiency gained by reducing initial errors within the identified sensitive areas confirmed these results. In general, the greatest improvement in a special time forecast was obtained by identifying the sensitive areas for the corresponding forecast time period.
基金the financial support of NKFIH-OTKA in the framework of contract K101603
文摘This paper summarizes the main instrumental and methodological points of the tidal research which was performed in the framework of the National Scientific Research Fund Project K101603. Since the project is still running the tidal analysis results published here are only preliminary. Unmodelled tidal effects have been highlighted in some recent absolute gravity measurements carried out in the Pannonian basin resulting in a periodic modulation exceeding the typical standard deviations (±1microGal) of the drop sets. Since the most dominant source of the daily gravity variation is the bulk tidal effect, the goal of the project is to check its location dependency at BGal level. Unfortunately Hungary has had no dedicated instrumentation, so an effort was made to make the available LaCoste- Romberg spring G meters capable for continuous recording. As a reference instrument the GWR SG025 operated in the Conrad Observatory, Austria was also used and in the mean time of the project, a Scintrex CG-5 became also available, Eventually 6 instruments at 5 different locations were operated for 3 9 months mainly in co-located configuration. Although many experiments (moving mass calibrations) were done to determine the scale factors and scale functions of the instruments, the direct comparison of the tidal parameters obtained from the observations is still questionable. Therefore the ratio of the delta factors of O1 and M2 tidal constituents was investigated supposing that M2 is much more influenced by the ocean loading effect than O1. The slight detected increase of δ(O1 )/δ(M2) (≈0.2%) toward east does not contradict to theory. This result has to be validated in the near future by analyzing available ocean loading models.
基金supported by the National Natural Science Foundation of China (No. 61502043, No. 61132001)Beijing Natural Science Foundation (No. 4162042)BeiJing Talents Fund (No. 2015000020124G082)
文摘With the growing popularity of data-intensive services on the Internet, the traditional process-centric model for business process meets challenges due to the lack of abilities to describe data semantics and dependencies, resulting in the inflexibility of the design and implement for the processes. This paper proposes a novel data-aware business process model which is able to describe both explicit control flow and implicit data flow. Data model with dependencies which are formulated by Linear-time Temporal Logic(LTL) is presented, and their satisfiability is validated by an automaton-based model checking algorithm. Data dependencies are fully considered in modeling phase, which helps to improve the efficiency and reliability of programming during developing phase. Finally, a prototype system based on j BPM for data-aware workflow is designed using such model, and has been deployed to Beijing Kingfore heating management system to validate the flexibility, efficacy and convenience of our approach for massive coding and large-scale system management in reality.
文摘This study addresses whether gold exhibits the function of a hedge or safe haven as often referred to in academia.It contributes to the existing literature by(i)revisiting this question for the principal stock markets in the Middle East and North Africa(MENA)region and(ii)using the copula-quantile-on-quantile and conditional value at risk methods to detail the risks facing market participants provided with accurate information about various gold and stock market scenarios(i.e.,bear,normal,bull).The results provide strong evidence of quantile dependence between gold and stock returns.Positive correlations are found between MENA gold and stock markets when both are bullish.Conversely,when stock returns are bearish,gold markets show negative correlations with MENA stock markets.The risk spillover from gold to stock markets intensified during the global financial and European crises.Given the risk spillover between gold and stock markets,investors in MENA markets should be careful when considering gold as a safe haven because its effectiveness as a hedge is not the same in all MENA stock markets.Investors and portfolio managers should rebalance their portfolio compositions under various gold and stock market conditions.Overall,such precise insights about the heterogeneous linkages and spillovers between gold and MENA stock returns provide potential input for developing effective hedging strategies and optimal portfolio allocations.
基金Supported by the Key Program of National Natural Science Foundation of China(61232002)The National Natural Science Foundation of China(61202033)+2 种基金The Program for Innovative Research Team of Wuhan(2014070504020237)The Ph.D.Seed Foundation of Wuhan University(2012211020207)The Science and Technology Support Program of Hubei Province(2015BAA127)
文摘In uncertain data management, lineages are often used for probability computation of result tuples. However, most of existing works focus on tuple level lineage, which results in imprecise data derivation. Besides, correlations among attributes cannot be captured. In this paper, for base tuples with multiple uncertain attributes, we define attribute level annotation to annotate each attribute. Utilizing these annotations to generate lineages of result tuples can realize more precise derivation. Simultaneously,they can be used for dependency graph construction. Utilizing dependency graph, we can represent not only constraints on schemas but also correlations among attributes. Combining the dependency graph and attribute level lineage, we can correctly compute probabilities of result tuples and precisely derivate data. In experiments, comparing lineage on tuple level and attribute level, it shows that our method has advantages on derivation precision and storage cost.
文摘Temperature and doping dependencies of the transport properties have been calculated using an ensemble Monte Carlo simulation. We consider the polar optical phonon, acoustic phonons, piezoelectric, intervalley scatterings and Charged impurity scattering model of Ridley;furthermore, a non nonparabolic three-valley model is used. Our simulation results have shown that the electron velocity in GaN is less sensitive to changes in temperature than that associated with GaAs. Also it is found that GaN exhibits high peak drift velocity at room temperature, 2.8 × 105m/s, at doping concentration of 1 × 1020 m–3and the electron drift velocity relaxes to the saturation value of 1.3 × 105 m/s which is much larger than that of GaAs. The weakening of the phonon emission rate at low temperature explains the extremely high low field mobility. Our results suggest that the transport characteristics of GaN are superior to that of GaAs, over a wide range of temperatures, from 100 K to 700 K, and doping concentrations, up to 1 × 1025
基金the National Natural Science Foundation of China(Grant:“MycoDisp:Implications of connectance of mycorrhizal habitats for the functioning of ecosystems”,with the grant agreement number C0311-32371721).
文摘Conspecific negative density dependencies(CNDDs)foster biodiversity through reducing the chances of competitive exclusion in plant communities and have therefore fascinated ecologists.A major driver of CNDDs is plant-soil feedback,and a lot of the literature assumes that the triggers of CNDDs concur with those for plant-soil feedback.Here,we suggest that a core assumption of a lot of the literature on CNDDs,that CNDDs are stronger in AM-associated than ECM-associated trees,is not quite as well supported as widely claimed.We think that dismissing this very important consideration prevents us from identifying a major gap in the literature on CNDDs.The vast majority of the literature on mycorrhiza-induced CNDDs originates from temperate systems,but the findings are extrapolated across divergent ecosystems.We then develop the argument that likely propagule limitations for arbuscular mycorrhizal trees in temperate forests might be inducing stronger CNDDs than they do at propagule sufficiency,which arbuscular mycorrhizal trees usually experience in other systems.We are thus contributing a new hypothesis in the field of mycorrhizal ecology with the potential to unify observations across scales and biomes.
基金Supported in part by the National Natural Science Foundation of China(12275156,12321005,12375076)and the Taishan Scholar Foundation of Shandong province(tsqn201909011)。
文摘Precise determination of the Higgs boson self-couplings is essential for understanding the mechanism underlying electroweak symmetry breaking.However,owing to the limited number of Higgs boson pair events at the LHC,only loose constraints have been established to date.Current constraints are based on the assumption that the cross section is a quadratic function of the trilinear Higgs self-coupling within the framework.Incorporating higher-order quantum corrections from virtual Higgs bosons would significantly alter this functional form,introducing new quartic and cubic power dependencies on the trilinear Higgs self-coupling.To derive this new functional form,we propose a specialized renormalization procedure that tracks all Higgs self-couplings at each calculation step.Additionally,we introduce renormalization constants for coupling modifiers within the framework to ensure the cancellation of all ultraviolet divergences.With new functional forms of the cross sections in both the gluon-gluon fusion and vector boson fusion channels,the upper limit of kλ_(3H)=λ_(3H)^(SM)set by the ATLAS(CMS)collaboration is reduced from 6.6(6.49)to 5.4(5.37).However,extracting a meaningful constraint on the quartic Higgs self-coupling from Higgs boson pair production data remains challenging.We also present the invariant mass distributions of the Higgs boson pair at different values of the self-couplings,which could assist in setting optimal cuts for experimental analysis.
基金supported by Division of Electrical,Communications and Cyber Systems(1809753)National Institute of Food and Agriculture(2020-67021-31961).
文摘Automation of plant phenotyping using data from high-dimensional imaging sensors is on the forefront of agricultural research for its potential to improve seasonal yield by monitoring crop health and accelerating breeding programs.A common challenge when capturing images in the field relates to the spectral reflection of sunlight(glare)from crop leaves that,at certain solar incidences and sensor viewing angles,presents unwanted signals.The research presented here involves the convergence of 2 parallel projects to develop a facile algorithm that can use polarization data to decouple light reflected from the surface of the leaves and light scattered from the leaf's tissue.
基金supported by the Urgent Need for Overseas Talent Project of Jiangxi Province(Grant No.20223BCJ25040)the Thousand Talents Plan of Jiangxi Province(Grant No.jxsg2023101085)+3 种基金the National Natural Science Foundation of China(Grant No.62106093)the Natural Science Foundation of Jiangxi(Grant Nos.20224BAB212011,20232BAB212008,20242BAB25078,and 20232BAB202051)The Youth Talent Cultivation Innovation Fund Project of Nanchang University(Grant No.XX202506030015)funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2025R759),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘Predicting human motion based on historical motion sequences is a fundamental problem in computer vision,which is at the core of many applications.Existing approaches primarily focus on encoding spatial dependencies among human joints while ignoring the temporal cues and the complex relationships across non-consecutive frames.These limitations hinder the model’s ability to generate accurate predictions over longer time horizons and in scenarios with complex motion patterns.To address the above problems,we proposed a novel multi-level spatial and temporal learning model,which consists of a Cross Spatial Dependencies Encoding Module(CSM)and a Dynamic Temporal Connection Encoding Module(DTM).Specifically,the CSM is designed to capture complementary local and global spatial dependent information at both the joint level and the joint pair level.We further present DTM to encode diverse temporal evolution contexts and compress motion features to a deep level,enabling the model to capture both short-term and long-term dependencies efficiently.Extensive experiments conducted on the Human 3.6M and CMU Mocap datasets demonstrate that our model achieves state-of-the-art performance in both short-term and long-term predictions,outperforming existing methods by up to 20.3% in accuracy.Furthermore,ablation studies confirm the significant contributions of the CSM and DTM in enhancing prediction accuracy.
文摘Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of cer- tain attribute values in pairs of database tuples when some similarity conditions on other values are satisfied. Their en- forcement can be seen as a natural generalization of entity resolution. In what we call the pure case of MD enforce- ment, an arbitrary value from the underlying data domain can be used for the value in common that is used for a match- ing. However, the overall number of changes of attribute val- ues is expected to be kept to a minimum. We investigate this case in terms of semantics and the properties of data clean- ing through the enforcement of MDs. We characterize the in- tended clean instances, and also the clean answers to queries, as those that are invariant under the cleaning process. The complexity of computing clean instances and clean query an- swering is investigated. Tractable and intractable cases de- pending on the MDs are identified and characterized.
基金the funding support from the National Natural Science Foundation of China(Grant Nos.52304101 and 52004206)the China Postdoctoral Science Foundation(Grant No.2023MD734215)。
文摘Backfill is often employed in mining operations for ground support,with its positive impact on ground stability acknowledged in many underground mines.However,existing studies have predominantly focused only on the stress development within the backfill material,leaving the influence of stope backfilling on stress distribution in surrounding rock mass and ground stability largely unexplored.Therefore,this paper presents numerical models in FLAC3D to investigate,for the first time,the time-dependent stress redistribution around a vertical backfilled stope and its implications on ground stability,considering the creep of surrounding rock mass.Using the Soft Soil constitutive model,the compressibility of backfill under large pressure was captured.It is found that the creep deformation of rock mass exercises compression on backfill and results in a less void ratio and increased modulus for fill material.The compacted backfill conversely influenced the stress distribution and ground stability of rock mass which was a combined effect of wall creep and compressibility of backfill.With the increase of time or/and creep deformation,the minimum principal stress in the rocks surrounding the backfilled stope increased towards the pre-mining stress state,while the deviatoric stress reduces leading to an increased factor of safety and improved ground stability.This improvement effect of backfill on ground stability increased with the increase of mine depth and stope height,while it is also more pronounced for the narrow stope,the backfill with a smaller compression index,and the soft rocks with a smaller viscosity coefficient.Furthermore,the results emphasize the importance of minimizing empty time and backfilling extracted stope as soon as possible for ground control.Reduction of filling gap height enhances the local stability around the roof of stope.