A personalized outfit recommendation has emerged as a hot research topic in the fashion domain.However,existing recommendations do not fully exploit user style preferences.Typically,users prefer particular styles such...A personalized outfit recommendation has emerged as a hot research topic in the fashion domain.However,existing recommendations do not fully exploit user style preferences.Typically,users prefer particular styles such as casual and athletic styles,and consider attributes like color and texture when selecting outfits.To achieve personalized outfit recommendations in line with user style preferences,this paper proposes a personal style guided outfit recommendation with multi-modal fashion compatibility modeling,termed as PSGNet.Firstly,a style classifier is designed to categorize fashion images of various clothing types and attributes into distinct style categories.Secondly,a personal style prediction module extracts user style preferences by analyzing historical data.Then,to address the limitations of single-modal representations and enhance fashion compatibility,both fashion images and text data are leveraged to extract multi-modal features.Finally,PSGNet integrates these components through Bayesian personalized ranking(BPR)to unify the personal style and fashion compatibility,where the former is used as personal style features and guides the output of the personalized outfit recommendation tailored to the target user.Extensive experiments on large-scale datasets demonstrate that the proposed model is efficient on the personalized outfit recommendation.展开更多
Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become availa...Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become available, which leads to the rise of multi-modal gesture recognition. Since our previous approach to gesture recognition depends on a unimodal system, it is difficult to classify similar motion patterns. In order to solve this problem, a novel approach which integrates motion, audio and video models is proposed by using dataset captured by Kinect. The proposed system can recognize observed gestures by using three models. Recognition results of three models are integrated by using the proposed framework and the output becomes the final result. The motion and audio models are learned by using Hidden Markov Model. Random Forest which is the video classifier is used to learn the video model. In the experiments to test the performances of the proposed system, the motion and audio models most suitable for gesture recognition are chosen by varying feature vectors and learning methods. Additionally, the unimodal and multi-modal models are compared with respect to recognition accuracy. All the experiments are conducted on dataset provided by the competition organizer of MMGRC, which is a workshop for Multi-Modal Gesture Recognition Challenge. The comparison results show that the multi-modal model composed of three models scores the highest recognition rate. This improvement of recognition accuracy means that the complementary relationship among three models improves the accuracy of gesture recognition. The proposed system provides the application technology to understand human actions of daily life more precisely.展开更多
The improved version of Los Alamos model with the multi-modal fission approach is used to analyse the prompt fission neutron spectrum and multiplicity for the neutron-induced fission of 237Np. The spectra of neutrons ...The improved version of Los Alamos model with the multi-modal fission approach is used to analyse the prompt fission neutron spectrum and multiplicity for the neutron-induced fission of 237Np. The spectra of neutrons emitted from fragments for the three most dominant fission modes (standard Ⅰ, standard Ⅱ and superlong) are calculated separately and the total spectrum is synthesized. The multi-modal parameters contained in the spectrum model are determined on the basis of experimental data of fission fragment mass distributions. The calculated total prompt fission neutron spectrum and multiplicity are better agreement with the experimental data than those obtained from the conventional treatment of the Los Alamos model.展开更多
An attempt is made to improve the evaluation of the prompt fission neutron emis- sion from 233U(n, f) reaction for incident neutron energies below 6 MeV. The multi-modal fission approach is applied to the improved v...An attempt is made to improve the evaluation of the prompt fission neutron emis- sion from 233U(n, f) reaction for incident neutron energies below 6 MeV. The multi-modal fission approach is applied to the improved version of Los Alamos model and the point by point model. The prompt fission neutron spectra and the prompt fission neutron as a function of fragment mass (usually named "sawtooth" data) v(A) are calculated independently for the three most dominant fission modes (standard I, standard II and superlong), and the total spectra and v(A) are syn- thesized. The multi-modal parameters are determined on the basis of experimental data of fission fragment mass distributions. The present calculation results can describe the experimental data very well, and the proposed treatment is thus a useful tool for prompt fission neutron emission prediction.展开更多
Multi-modal Named Entity Recognition(MNER)aims to better identify meaningful textual entities by integrating information from images.Previous work has focused on extracting visual semantics at a fine-grained level,or ...Multi-modal Named Entity Recognition(MNER)aims to better identify meaningful textual entities by integrating information from images.Previous work has focused on extracting visual semantics at a fine-grained level,or obtaining entity related external knowledge from knowledge bases or Large Language Models(LLMs).However,these approaches ignore the poor semantic correlation between visual and textual modalities in MNER datasets and do not explore different multi-modal fusion approaches.In this paper,we present MMAVK,a multi-modal named entity recognition model with auxiliary visual knowledge and word-level fusion,which aims to leverage the Multi-modal Large Language Model(MLLM)as an implicit knowledge base.It also extracts vision-based auxiliary knowledge from the image formore accurate and effective recognition.Specifically,we propose vision-based auxiliary knowledge generation,which guides the MLLM to extract external knowledge exclusively derived from images to aid entity recognition by designing target-specific prompts,thus avoiding redundant recognition and cognitive confusion caused by the simultaneous processing of image-text pairs.Furthermore,we employ a word-level multi-modal fusion mechanism to fuse the extracted external knowledge with each word-embedding embedded from the transformerbased encoder.Extensive experimental results demonstrate that MMAVK outperforms or equals the state-of-the-art methods on the two classical MNER datasets,even when the largemodels employed have significantly fewer parameters than other baselines.展开更多
[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-base...[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-based models that utilize either images data or environmental data.These methods fail to fully leverage multi-modal data to capture the diverse aspects of plant growth comprehensively.[Methods]To address this limitation,a two-stage phenotypic feature extraction(PFE)model based on deep learning algorithm of recurrent neural network(RNN)and long short-term memory(LSTM)was developed.The model integrated environment and plant information to provide a holistic understanding of the growth process,emploied phenotypic and temporal feature extractors to comprehensively capture both types of features,enabled a deeper understanding of the interaction between tomato plants and their environment,ultimately leading to highly accurate predictions of growth height.[Results and Discussions]The experimental results showed the model's ef‐fectiveness:When predicting the next two days based on the past five days,the PFE-based RNN and LSTM models achieved mean absolute percentage error(MAPE)of 0.81%and 0.40%,respectively,which were significantly lower than the 8.00%MAPE of the large language model(LLM)and 6.72%MAPE of the Transformer-based model.In longer-term predictions,the 10-day prediction for 4 days ahead and the 30-day prediction for 12 days ahead,the PFE-RNN model continued to outperform the other two baseline models,with MAPE of 2.66%and 14.05%,respectively.[Conclusions]The proposed method,which leverages phenotypic-temporal collaboration,shows great potential for intelligent,data-driven management of tomato cultivation,making it a promising approach for enhancing the efficiency and precision of smart tomato planting management.展开更多
In this work, we propose a new variational model for multi-modal image registration and present an efficient numerical implementation. The model minimizes a new functional based on using reformulated normalized gradie...In this work, we propose a new variational model for multi-modal image registration and present an efficient numerical implementation. The model minimizes a new functional based on using reformulated normalized gradients of the images as the fidelity term and higher-order derivatives as the regularizer. A key feature of the model is its ability of guaranteeing a diffeomorphic transformation which is achieved by a control term motivated by the quasi-conformal map and Beltrami coefficient. The existence of the solution of this model is established. To solve the model numerically, we design a Gauss-Newton method to solve the resulting discrete optimization problem and prove its convergence;a multilevel technique is employed to speed up the initialization and avoid likely local minima of the underlying functional. Finally, numerical experiments demonstrate that this new model can deliver good performances for multi-modal image registration and simultaneously generate an accurate diffeomorphic transformation.展开更多
Modal choice models applied to interregional or international freight transportation network models are often based on rather coarse origin-destination matrices, containing annual transported tonnages between (sub)reg...Modal choice models applied to interregional or international freight transportation network models are often based on rather coarse origin-destination matrices, containing annual transported tonnages between (sub)regions, for instance. Generally, only basic (sometimes constructed) independent variables (transportation costs or transit times) are used because other variables such as shipment sizes, service frequencies, etc. are not available. Using origin-destination matrices and an assignment model, it is also possible to compute spatial accessibility measures that can further be used as additional explanatory variables. Indeed, several published studies have identified network accessibility as an important element in the mode-choice decision. This paper also shows that the inclusion of an accessibility measure in the utility functions of a logit model substantially improves the performance of a transportation network model, both in the modal choice and the assignment levels of the classical four-step model. Consequently, the assignment of the estimated modal demands results in more accurate estimated traffic on the networks. The model presented in this paper is to be considered as a proof of concept because its workflow should further be streamlined to make it easily useable by modelers.展开更多
In this paper, we develop a new path search algorithm which considers all the degrees of freedom and apply it on our calculated five-dimensional potential energy surface(PES) of^(236) U. Asymmetric and symmetric fissi...In this paper, we develop a new path search algorithm which considers all the degrees of freedom and apply it on our calculated five-dimensional potential energy surface(PES) of^(236) U. Asymmetric and symmetric fission paths and barriers are obtained.展开更多
In order to apply overbooking idea in Chinese railway freight industry to improve revenue, a Markov decision process(dynamic programming) model for railway freight reservation was formulated and the overbooking limit ...In order to apply overbooking idea in Chinese railway freight industry to improve revenue, a Markov decision process(dynamic programming) model for railway freight reservation was formulated and the overbooking limit level was proposed as a control policy. However, computing the dynamic programming treatment needs six nested loops and this will be burdensome for real-world problems. To break through the calculation limit, the properties of value function were analyzed and the overbooking protection level was proposed to reduce the calculating quantity. The simulation experiments show that the overbooking protection level for the lower-fare class is higher than that for the higher-fare class, so the overbooking strategy is nested by fare class. Besides, by analyzing the influence on the overbooking strategy of freight arrival probability and cancellation probability, the proposed approach is efficient and also has a good application prospect in reality. Also, compared with the existing reservation(FCFS), the overbooking strategy performs better in the fields of vacancy reduction and revenue improvement.展开更多
The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigat...The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigation scheme based on Interacting Multiple Nonlinear Fuzzy Adaptive H_∞ Models(IMM-NFAH_∞) filtering technique for UAV is presented. The proposed IMM-NFAH_∞ strategy switches between two different Nonlinear Fuzzy Adaptive H_∞(NFAH_∞) filters and each NFAH_∞ filter is based on different fuzzy logic inference systems. The newly proposed technique takes into consideration the high order Taylor series terms and adapts the nonlinear H_∞ filter based on different fuzzy inference systems via adaptive filter bounds(di),along with disturbance attenuation parameter c. Simulation analysis validates the performance of the proposed algorithm, and the comparison with nonlinear H_∞(NH_∞) filter and that with different NFAH_∞ filters demonstrate the effectiveness of UAV localization utilizing IMM-NFAH_∞ filter.展开更多
With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much att...With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much attention to heaRthcare robots and rehabilitation robots. To get natural and harmonious communication between the user and a service robot, the information perception/feedback ability, and interaction ability for service robots become more important in many key issues.展开更多
This paper proposes a modeling system developed in order to analyze the urban freight transport and logistics within urban and metropolitan areas. A review of models developed to simulate this segment of mobility is a...This paper proposes a modeling system developed in order to analyze the urban freight transport and logistics within urban and metropolitan areas. A review of models developed to simulate this segment of mobility is also reported. The review analysis highlights the limits of models for the ex-ante assessment of city logistics measures. For this reason this paper proposes a new modeling system approach for the assessment of city logistics measures. It is made of different steps approaching problems related to quantity OD (Origin-Destination) flows, restocking type OD flows, delivery OD flows, delivery OD flows for time slice and vehicle type, and vehicle OD flows. This modeling system has been specified and calibrated using some surveys carried out in the inner area of Rome where more than 500 truck drivers and more than 600 retailers have been interviewed.展开更多
This paper presents a general framework that can be used to estimate direct and cross elasticities for freight transport using a network model. This methodology combines operational research (network assignments in a ...This paper presents a general framework that can be used to estimate direct and cross elasticities for freight transport using a network model. This methodology combines operational research (network assignments in a geographical information system) with more classical econometrics (multinomial logit choice models). The application of the method to a real-world case is illustrated by a simple model that relies on the generalized cost of transport as the only explanatory variable in the utility function. The methodological framework allows, however, for the implementation of more complex functions. Beside the generalized cost functions for road, rail and inland waterways transport, the network model needs origin-destination matrixes and digitized networks. They are imported from ETIS Plus, a European transport policy information system. A set of direct and cross elasticities is presented. The estimated values are obtained using two methods: the first computes standard elasticities, while the second estimates arc elasticities. Figures are presented for Europe and for a large region around the Benelux countries, where more competition exists between the three modes of interest.展开更多
There is an explicit and implicit assumption in multimodal traffic equilibrium models, that is, if the equilibrium exists, then it will also occur. The assumption is very idealized; in fact, it may be shown that the q...There is an explicit and implicit assumption in multimodal traffic equilibrium models, that is, if the equilibrium exists, then it will also occur. The assumption is very idealized; in fact, it may be shown that the quite contrary could happen, because in multimodal traffic network, especially in mixed traffic conditions the interaction among traffic modes is asymmetric and the asymmetric interaction may result in the instability of traffic system. In this paper, to study the stability of multimodal traffic system, we respectively present the travel cost function in mixed traffic conditions and in traffic network with dedicated bus lanes. Based on a day-to-day dynamical model, we study the evolution of daily route choice of travelers in multimodal traffic network using 10000 random initial values for different cases. From the results of simulation, it can be concluded that the asymmetric interaction between the cars and buses in mixed traffic conditions can lead the traffic system to instability when traffic demand is larger. We also study the effect of travelers' perception error on the stability of multimodal traffic network. Although the larger perception error can alleviate the effect of interaction between cars and buses and improve the stability of traffic system in mixed traffic conditions, the traffic system also become instable when the traffic demand is larger than a number. For all cases simulated in this study, with the same parameters, traffic system with dedicated bus lane has better stability for traffic demand than that in mixed traffic conditions. We also find that the network with dedicated bus lane has higher portion of travelers by bus than it of mixed traffic network. So it can be concluded that building dedicated bus lane can improve the stability of traffic system and attract more travelers to choose bus reducing the traffic congestion.展开更多
Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event eleme...Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event elements from multi-modal data remains a challenging task due to the presence of a large number of images and overlapping event elements in the data.Although researchers have proposed various methods to accomplish this task,most existing event extraction models cannot address these challenges because they are only applicable to text scenarios.To solve the above issues,this paper proposes a multi-modal event extraction method based on knowledge fusion.Specifically,for event-type recognition,we use a meticulous pipeline approach that integrates multiple pre-trained models.This approach enables a more comprehensive capture of the multidimensional event semantic features present in military texts,thereby enhancing the interconnectedness of information between trigger words and events.For event element extraction,we propose a method for constructing a priori templates that combine event types with corresponding trigger words.This approach facilitates the acquisition of fine-grained input samples containing event trigger words,thus enabling the model to understand the semantic relationships between elements in greater depth.Furthermore,a fusion method for spatial mapping of textual event elements and image elements is proposed to reduce the category number overload and effectively achieve multi-modal knowledge fusion.The experimental results based on the CCKS 2022 dataset show that our method has achieved competitive results,with a comprehensive evaluation value F1-score of 53.4%for the model.These results validate the effectiveness of our method in extracting event elements from multi-modal data.展开更多
The development of experimental animal models for head and neck tumors generally rely on the biol uminescence imaging to achieve the dynamic monitoring of the tumor growth and metastasis due to the complicated anatomi...The development of experimental animal models for head and neck tumors generally rely on the biol uminescence imaging to achieve the dynamic monitoring of the tumor growth and metastasis due to the complicated anatomical structures.Since the bioluminescence imaging is largely affected by the intracellular luciferase expression level and external D-luciferin concentrations,its imaging accuracy requires further confirmation.Here,a new triple fusion reportelr gene,which consists of a herpes simplex virus type 1 thymidine kinase(TK)gene for radioactive imaging,a far-red fuorescent protein(mLumin)gene for fuorescent imaging,and a firefly luciferase gene for bioluminescence imaging,was introduced for in vrivo observation of the head and neck tumors through multi-modality imaging.Results show that fuorescence and bioluminescence signals from mLumin and luciferase,respectively,were clearly observed in tumor cells,and TK could activate suicide pathway of the cells in the presence of nucleotide analog-ganciclovir(GCV),demonstrating the effecti veness of individual functions of each gene.Moreover,subcutaneous and metastasis animal models for head and neck tumors using the fusion reporter gene-expressing cell lines were established,allowing multi-modality imaging in vio.Together,the established tumor models of head and neck cancer based on the newly developed triple fusion reporter gene are ideal for monitoring tumor growth,assessing the drug therapeutic efficacy and verifying the effec-tiveness of new treatments.展开更多
This research study explores the use of an innovative freight tour-based approach to model truck trips as an alternative to the conventional trip-based approach. The tour-based approach is more realistic as it capture...This research study explores the use of an innovative freight tour-based approach to model truck trips as an alternative to the conventional trip-based approach. The tour-based approach is more realistic as it captures the intermediate stops of each truck and reflects the implications of those stops on vehicle miles traveled (VMT). The paper describes the truck tour-based model concept, and presents the framework of a truck tour-based travel demand forecasting approach. As a case study, Global Positioning System (GPS) truck data are used to determine origin, destination, and truck stops for trucks moving within the Birmingham, Alabama region. Such information is then utilized to model truck movements within the study region as individual truck tours. The tour-based model is ran, and the resulting performance measures are contrasted to those obtained from the conventional trip-based planning model used by the Regional Planning Commission of Greater Birmingham (RPCGB). This case study demonstrates the feasibility of using a tour-based freight demand forecasting model as an alternative to the conventional 4-step process currently used to estimate truck trips in the Birmingham region. The results and lessons learned from the Birmingham case study are expected to improve truck movement modeling practices in the region and advance the accuracy of truck travel demand forecasting models at other locations in the future.展开更多
Purpose–This paper aims to provide a comprehensive analysis of the strategic adjustments in China’s transportation structure,with a particular focus on the pivotal role of railway freight and its integration into th...Purpose–This paper aims to provide a comprehensive analysis of the strategic adjustments in China’s transportation structure,with a particular focus on the pivotal role of railway freight and its integration into the modern logistics system.It seeks to address the need for a more nuanced understanding of the“road to rail”policy,emphasizing the importance of intermodal collaboration and service of fragmented market demands.Design/methodology/approach–The study employs a transport economics perspective to evaluate the achievements and shortcomings of China’s transportation structure optimization.It bases its assessment of the current state of railway freight logistics,multi-modal transportation and the broader implications for the transportation service market on data analysis.The methodology includes a review of existing policies,an examination of industry practices and a comparative analysis with global trends in railway logistics.Findings–The research underscores the importance of focusing on the development of non-bulk materials,noting the insufficiency in the development of China’s rail multi-modal transportation and highlighting the instructive value of successful cases in open-top container road-rail intermodal transportation.The study posits that the railway sector must enhance cooperation with other market entities,aligning with the lead enterprises in the logistics chain that are characterized by speed,high value and strong coordination capabilities,in order to better serve the transportation market.This approach moves away from a reliance on the railway’s own capabilities alone.Originality/value–This paper offers original insights into the transformation of railway freight in China,contributing to the body of knowledge on transportation economics and logistics.It provides valuable recommendations for policymakers and industry practitioners,emphasizing the strategic importance of railway logistics in the context of China’s economic development and intense competition in the supply chain.The value of the article lies in its comprehensive understanding of the complexities involved in the adjustment of transportation structures,providing direction for the market-oriented reform of China’s railway freight sector.展开更多
Industrial ebullated-bed is an important device for promoting the cleaning and upgrading of oil products. The lumped kinetic model is a powerful tool for predicting the product yield of the ebullated-bed residue hydro...Industrial ebullated-bed is an important device for promoting the cleaning and upgrading of oil products. The lumped kinetic model is a powerful tool for predicting the product yield of the ebullated-bed residue hydrogenation (EBRH) unit, However, during the long-term operation of the device, there are phenomena such as low frequency of material property analysis leading to limited operating data and diverse operating modes at the same time scale, which poses a huge challenge to building an accurate product yield prediction model. To address these challenges, a data augmentation-based eleven lumped reaction kinetics mechanism model was constructed. This model combines generative adversarial networks, outlier elimination, and L2 norm data filtering to expand the dataset and utilizes kernel principal component analysis-fuzzy C-means for operating condition partitioning. Based on the hydrogenation reaction mechanism, a single and sub operating condition eleven lumped reaction kinetics model of an ebullated-bed residue hydrogenation unit, comprising 55 reaction paths and 110 parameters, was constructed before and after data augmentation. Compared to the single model before data enhancement, the average absolute error of the sub-models under data enhancement division was reduced by 23%. Thus, these findings can help guide the operation and optimization of the production process.展开更多
基金Shanghai Frontier Science Research Center for Modern Textiles,Donghua University,ChinaOpen Project of Henan Key Laboratory of Intelligent Manufacturing of Mechanical Equipment,Zhengzhou University of Light Industry,China(No.IM202303)National Key Research and Development Program of China(No.2019YFB1706300)。
文摘A personalized outfit recommendation has emerged as a hot research topic in the fashion domain.However,existing recommendations do not fully exploit user style preferences.Typically,users prefer particular styles such as casual and athletic styles,and consider attributes like color and texture when selecting outfits.To achieve personalized outfit recommendations in line with user style preferences,this paper proposes a personal style guided outfit recommendation with multi-modal fashion compatibility modeling,termed as PSGNet.Firstly,a style classifier is designed to categorize fashion images of various clothing types and attributes into distinct style categories.Secondly,a personal style prediction module extracts user style preferences by analyzing historical data.Then,to address the limitations of single-modal representations and enhance fashion compatibility,both fashion images and text data are leveraged to extract multi-modal features.Finally,PSGNet integrates these components through Bayesian personalized ranking(BPR)to unify the personal style and fashion compatibility,where the former is used as personal style features and guides the output of the personalized outfit recommendation tailored to the target user.Extensive experiments on large-scale datasets demonstrate that the proposed model is efficient on the personalized outfit recommendation.
基金Supported by Grant-in-Aid for Young Scientists(A)(Grant No.26700021)Japan Society for the Promotion of Science and Strategic Information and Communications R&D Promotion Programme(Grant No.142103011)Ministry of Internal Affairs and Communications
文摘Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become available, which leads to the rise of multi-modal gesture recognition. Since our previous approach to gesture recognition depends on a unimodal system, it is difficult to classify similar motion patterns. In order to solve this problem, a novel approach which integrates motion, audio and video models is proposed by using dataset captured by Kinect. The proposed system can recognize observed gestures by using three models. Recognition results of three models are integrated by using the proposed framework and the output becomes the final result. The motion and audio models are learned by using Hidden Markov Model. Random Forest which is the video classifier is used to learn the video model. In the experiments to test the performances of the proposed system, the motion and audio models most suitable for gesture recognition are chosen by varying feature vectors and learning methods. Additionally, the unimodal and multi-modal models are compared with respect to recognition accuracy. All the experiments are conducted on dataset provided by the competition organizer of MMGRC, which is a workshop for Multi-Modal Gesture Recognition Challenge. The comparison results show that the multi-modal model composed of three models scores the highest recognition rate. This improvement of recognition accuracy means that the complementary relationship among three models improves the accuracy of gesture recognition. The proposed system provides the application technology to understand human actions of daily life more precisely.
基金Project supported by the State Key Development Program for Basic Research of China (Grant Nos 2008CB717803 and 2007ID103)the Research Fund for the Doctoral Program of Higher Education of China (Gant No 200610001023)
文摘The improved version of Los Alamos model with the multi-modal fission approach is used to analyse the prompt fission neutron spectrum and multiplicity for the neutron-induced fission of 237Np. The spectra of neutrons emitted from fragments for the three most dominant fission modes (standard Ⅰ, standard Ⅱ and superlong) are calculated separately and the total spectrum is synthesized. The multi-modal parameters contained in the spectrum model are determined on the basis of experimental data of fission fragment mass distributions. The calculated total prompt fission neutron spectrum and multiplicity are better agreement with the experimental data than those obtained from the conventional treatment of the Los Alamos model.
基金supported by the State Key Development Program for Basic Research of China (Nos. 2008CB717803, 2009GB107001, and2007CB209903)the Research Fund for the Doctoral Program of Higher Education of China (No. 200610011023)
文摘An attempt is made to improve the evaluation of the prompt fission neutron emis- sion from 233U(n, f) reaction for incident neutron energies below 6 MeV. The multi-modal fission approach is applied to the improved version of Los Alamos model and the point by point model. The prompt fission neutron spectra and the prompt fission neutron as a function of fragment mass (usually named "sawtooth" data) v(A) are calculated independently for the three most dominant fission modes (standard I, standard II and superlong), and the total spectra and v(A) are syn- thesized. The multi-modal parameters are determined on the basis of experimental data of fission fragment mass distributions. The present calculation results can describe the experimental data very well, and the proposed treatment is thus a useful tool for prompt fission neutron emission prediction.
基金funded by Research Project,grant number BHQ090003000X03.
文摘Multi-modal Named Entity Recognition(MNER)aims to better identify meaningful textual entities by integrating information from images.Previous work has focused on extracting visual semantics at a fine-grained level,or obtaining entity related external knowledge from knowledge bases or Large Language Models(LLMs).However,these approaches ignore the poor semantic correlation between visual and textual modalities in MNER datasets and do not explore different multi-modal fusion approaches.In this paper,we present MMAVK,a multi-modal named entity recognition model with auxiliary visual knowledge and word-level fusion,which aims to leverage the Multi-modal Large Language Model(MLLM)as an implicit knowledge base.It also extracts vision-based auxiliary knowledge from the image formore accurate and effective recognition.Specifically,we propose vision-based auxiliary knowledge generation,which guides the MLLM to extract external knowledge exclusively derived from images to aid entity recognition by designing target-specific prompts,thus avoiding redundant recognition and cognitive confusion caused by the simultaneous processing of image-text pairs.Furthermore,we employ a word-level multi-modal fusion mechanism to fuse the extracted external knowledge with each word-embedding embedded from the transformerbased encoder.Extensive experimental results demonstrate that MMAVK outperforms or equals the state-of-the-art methods on the two classical MNER datasets,even when the largemodels employed have significantly fewer parameters than other baselines.
文摘[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-based models that utilize either images data or environmental data.These methods fail to fully leverage multi-modal data to capture the diverse aspects of plant growth comprehensively.[Methods]To address this limitation,a two-stage phenotypic feature extraction(PFE)model based on deep learning algorithm of recurrent neural network(RNN)and long short-term memory(LSTM)was developed.The model integrated environment and plant information to provide a holistic understanding of the growth process,emploied phenotypic and temporal feature extractors to comprehensively capture both types of features,enabled a deeper understanding of the interaction between tomato plants and their environment,ultimately leading to highly accurate predictions of growth height.[Results and Discussions]The experimental results showed the model's ef‐fectiveness:When predicting the next two days based on the past five days,the PFE-based RNN and LSTM models achieved mean absolute percentage error(MAPE)of 0.81%and 0.40%,respectively,which were significantly lower than the 8.00%MAPE of the large language model(LLM)and 6.72%MAPE of the Transformer-based model.In longer-term predictions,the 10-day prediction for 4 days ahead and the 30-day prediction for 12 days ahead,the PFE-RNN model continued to outperform the other two baseline models,with MAPE of 2.66%and 14.05%,respectively.[Conclusions]The proposed method,which leverages phenotypic-temporal collaboration,shows great potential for intelligent,data-driven management of tomato cultivation,making it a promising approach for enhancing the efficiency and precision of smart tomato planting management.
文摘In this work, we propose a new variational model for multi-modal image registration and present an efficient numerical implementation. The model minimizes a new functional based on using reformulated normalized gradients of the images as the fidelity term and higher-order derivatives as the regularizer. A key feature of the model is its ability of guaranteeing a diffeomorphic transformation which is achieved by a control term motivated by the quasi-conformal map and Beltrami coefficient. The existence of the solution of this model is established. To solve the model numerically, we design a Gauss-Newton method to solve the resulting discrete optimization problem and prove its convergence;a multilevel technique is employed to speed up the initialization and avoid likely local minima of the underlying functional. Finally, numerical experiments demonstrate that this new model can deliver good performances for multi-modal image registration and simultaneously generate an accurate diffeomorphic transformation.
文摘Modal choice models applied to interregional or international freight transportation network models are often based on rather coarse origin-destination matrices, containing annual transported tonnages between (sub)regions, for instance. Generally, only basic (sometimes constructed) independent variables (transportation costs or transit times) are used because other variables such as shipment sizes, service frequencies, etc. are not available. Using origin-destination matrices and an assignment model, it is also possible to compute spatial accessibility measures that can further be used as additional explanatory variables. Indeed, several published studies have identified network accessibility as an important element in the mode-choice decision. This paper also shows that the inclusion of an accessibility measure in the utility functions of a logit model substantially improves the performance of a transportation network model, both in the modal choice and the assignment levels of the classical four-step model. Consequently, the assignment of the estimated modal demands results in more accurate estimated traffic on the networks. The model presented in this paper is to be considered as a proof of concept because its workflow should further be streamlined to make it easily useable by modelers.
基金Supported by the Major Research plan of National Natural Science of China under Grant No.11790324the National Key Research and Development Program of China under Grant Nos.2016YY0200804 and 2017YFF0206205+1 种基金the National Magnetic Confinement Fusion Science Program of China under Grant Nos.2013GB106004 and 2012GB-101003the National Natural Science Foundation of China under No.1179320013
文摘In this paper, we develop a new path search algorithm which considers all the degrees of freedom and apply it on our calculated five-dimensional potential energy surface(PES) of^(236) U. Asymmetric and symmetric fission paths and barriers are obtained.
基金Project(2010QZZD021)supported by the Fundamental Research Funds for the Central Universities,ChinaProject(2015F024)supported by China Railway Science and Technology Research Development Program
文摘In order to apply overbooking idea in Chinese railway freight industry to improve revenue, a Markov decision process(dynamic programming) model for railway freight reservation was formulated and the overbooking limit level was proposed as a control policy. However, computing the dynamic programming treatment needs six nested loops and this will be burdensome for real-world problems. To break through the calculation limit, the properties of value function were analyzed and the overbooking protection level was proposed to reduce the calculating quantity. The simulation experiments show that the overbooking protection level for the lower-fare class is higher than that for the higher-fare class, so the overbooking strategy is nested by fare class. Besides, by analyzing the influence on the overbooking strategy of freight arrival probability and cancellation probability, the proposed approach is efficient and also has a good application prospect in reality. Also, compared with the existing reservation(FCFS), the overbooking strategy performs better in the fields of vacancy reduction and revenue improvement.
基金supported by a grant from the National Natural Science Foundation of China(No.61375082)
文摘The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigation scheme based on Interacting Multiple Nonlinear Fuzzy Adaptive H_∞ Models(IMM-NFAH_∞) filtering technique for UAV is presented. The proposed IMM-NFAH_∞ strategy switches between two different Nonlinear Fuzzy Adaptive H_∞(NFAH_∞) filters and each NFAH_∞ filter is based on different fuzzy logic inference systems. The newly proposed technique takes into consideration the high order Taylor series terms and adapts the nonlinear H_∞ filter based on different fuzzy inference systems via adaptive filter bounds(di),along with disturbance attenuation parameter c. Simulation analysis validates the performance of the proposed algorithm, and the comparison with nonlinear H_∞(NH_∞) filter and that with different NFAH_∞ filters demonstrate the effectiveness of UAV localization utilizing IMM-NFAH_∞ filter.
文摘With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much attention to heaRthcare robots and rehabilitation robots. To get natural and harmonious communication between the user and a service robot, the information perception/feedback ability, and interaction ability for service robots become more important in many key issues.
文摘This paper proposes a modeling system developed in order to analyze the urban freight transport and logistics within urban and metropolitan areas. A review of models developed to simulate this segment of mobility is also reported. The review analysis highlights the limits of models for the ex-ante assessment of city logistics measures. For this reason this paper proposes a new modeling system approach for the assessment of city logistics measures. It is made of different steps approaching problems related to quantity OD (Origin-Destination) flows, restocking type OD flows, delivery OD flows, delivery OD flows for time slice and vehicle type, and vehicle OD flows. This modeling system has been specified and calibrated using some surveys carried out in the inner area of Rome where more than 500 truck drivers and more than 600 retailers have been interviewed.
文摘This paper presents a general framework that can be used to estimate direct and cross elasticities for freight transport using a network model. This methodology combines operational research (network assignments in a geographical information system) with more classical econometrics (multinomial logit choice models). The application of the method to a real-world case is illustrated by a simple model that relies on the generalized cost of transport as the only explanatory variable in the utility function. The methodological framework allows, however, for the implementation of more complex functions. Beside the generalized cost functions for road, rail and inland waterways transport, the network model needs origin-destination matrixes and digitized networks. They are imported from ETIS Plus, a European transport policy information system. A set of direct and cross elasticities is presented. The estimated values are obtained using two methods: the first computes standard elasticities, while the second estimates arc elasticities. Figures are presented for Europe and for a large region around the Benelux countries, where more competition exists between the three modes of interest.
基金Supported by the National Basic Research Development Program of China under Grant No. 2012CB725401, Fundamental Research Funds for the Central Universities under Grant No. 2012JBZ 005, Funds for International Cooperation and Exchange of the National Natural Science Foundation of China under Grant No. 71210001, National Natural Science Foundation of China under Grant No. 71271023, Foundation for the Author of National Excellent Doctoral Dissertation of China under Grant No. 201170, and Beijing Nova Program under Grant No. 2009A15
文摘There is an explicit and implicit assumption in multimodal traffic equilibrium models, that is, if the equilibrium exists, then it will also occur. The assumption is very idealized; in fact, it may be shown that the quite contrary could happen, because in multimodal traffic network, especially in mixed traffic conditions the interaction among traffic modes is asymmetric and the asymmetric interaction may result in the instability of traffic system. In this paper, to study the stability of multimodal traffic system, we respectively present the travel cost function in mixed traffic conditions and in traffic network with dedicated bus lanes. Based on a day-to-day dynamical model, we study the evolution of daily route choice of travelers in multimodal traffic network using 10000 random initial values for different cases. From the results of simulation, it can be concluded that the asymmetric interaction between the cars and buses in mixed traffic conditions can lead the traffic system to instability when traffic demand is larger. We also study the effect of travelers' perception error on the stability of multimodal traffic network. Although the larger perception error can alleviate the effect of interaction between cars and buses and improve the stability of traffic system in mixed traffic conditions, the traffic system also become instable when the traffic demand is larger than a number. For all cases simulated in this study, with the same parameters, traffic system with dedicated bus lane has better stability for traffic demand than that in mixed traffic conditions. We also find that the network with dedicated bus lane has higher portion of travelers by bus than it of mixed traffic network. So it can be concluded that building dedicated bus lane can improve the stability of traffic system and attract more travelers to choose bus reducing the traffic congestion.
基金supported by the National Natural Science Foundation of China(Grant No.81973695)Discipline with Strong Characteristics of Liaocheng University-Intelligent Science and Technology(Grant No.319462208).
文摘Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event elements from multi-modal data remains a challenging task due to the presence of a large number of images and overlapping event elements in the data.Although researchers have proposed various methods to accomplish this task,most existing event extraction models cannot address these challenges because they are only applicable to text scenarios.To solve the above issues,this paper proposes a multi-modal event extraction method based on knowledge fusion.Specifically,for event-type recognition,we use a meticulous pipeline approach that integrates multiple pre-trained models.This approach enables a more comprehensive capture of the multidimensional event semantic features present in military texts,thereby enhancing the interconnectedness of information between trigger words and events.For event element extraction,we propose a method for constructing a priori templates that combine event types with corresponding trigger words.This approach facilitates the acquisition of fine-grained input samples containing event trigger words,thus enabling the model to understand the semantic relationships between elements in greater depth.Furthermore,a fusion method for spatial mapping of textual event elements and image elements is proposed to reduce the category number overload and effectively achieve multi-modal knowledge fusion.The experimental results based on the CCKS 2022 dataset show that our method has achieved competitive results,with a comprehensive evaluation value F1-score of 53.4%for the model.These results validate the effectiveness of our method in extracting event elements from multi-modal data.
基金supported by the National Science and Technology Support Program of China(Grant No.2012BAI23B02)the China-Canada Joint Health Research Initiative(NSFC-30911120489,CIHR CCI-102936)111 Project of China(B07038).
文摘The development of experimental animal models for head and neck tumors generally rely on the biol uminescence imaging to achieve the dynamic monitoring of the tumor growth and metastasis due to the complicated anatomical structures.Since the bioluminescence imaging is largely affected by the intracellular luciferase expression level and external D-luciferin concentrations,its imaging accuracy requires further confirmation.Here,a new triple fusion reportelr gene,which consists of a herpes simplex virus type 1 thymidine kinase(TK)gene for radioactive imaging,a far-red fuorescent protein(mLumin)gene for fuorescent imaging,and a firefly luciferase gene for bioluminescence imaging,was introduced for in vrivo observation of the head and neck tumors through multi-modality imaging.Results show that fuorescence and bioluminescence signals from mLumin and luciferase,respectively,were clearly observed in tumor cells,and TK could activate suicide pathway of the cells in the presence of nucleotide analog-ganciclovir(GCV),demonstrating the effecti veness of individual functions of each gene.Moreover,subcutaneous and metastasis animal models for head and neck tumors using the fusion reporter gene-expressing cell lines were established,allowing multi-modality imaging in vio.Together,the established tumor models of head and neck cancer based on the newly developed triple fusion reporter gene are ideal for monitoring tumor growth,assessing the drug therapeutic efficacy and verifying the effec-tiveness of new treatments.
文摘This research study explores the use of an innovative freight tour-based approach to model truck trips as an alternative to the conventional trip-based approach. The tour-based approach is more realistic as it captures the intermediate stops of each truck and reflects the implications of those stops on vehicle miles traveled (VMT). The paper describes the truck tour-based model concept, and presents the framework of a truck tour-based travel demand forecasting approach. As a case study, Global Positioning System (GPS) truck data are used to determine origin, destination, and truck stops for trucks moving within the Birmingham, Alabama region. Such information is then utilized to model truck movements within the study region as individual truck tours. The tour-based model is ran, and the resulting performance measures are contrasted to those obtained from the conventional trip-based planning model used by the Regional Planning Commission of Greater Birmingham (RPCGB). This case study demonstrates the feasibility of using a tour-based freight demand forecasting model as an alternative to the conventional 4-step process currently used to estimate truck trips in the Birmingham region. The results and lessons learned from the Birmingham case study are expected to improve truck movement modeling practices in the region and advance the accuracy of truck travel demand forecasting models at other locations in the future.
基金supported by the Yuxiu Innovation Project of NCUT(Grant No.2024NCUTYXCX211).
文摘Purpose–This paper aims to provide a comprehensive analysis of the strategic adjustments in China’s transportation structure,with a particular focus on the pivotal role of railway freight and its integration into the modern logistics system.It seeks to address the need for a more nuanced understanding of the“road to rail”policy,emphasizing the importance of intermodal collaboration and service of fragmented market demands.Design/methodology/approach–The study employs a transport economics perspective to evaluate the achievements and shortcomings of China’s transportation structure optimization.It bases its assessment of the current state of railway freight logistics,multi-modal transportation and the broader implications for the transportation service market on data analysis.The methodology includes a review of existing policies,an examination of industry practices and a comparative analysis with global trends in railway logistics.Findings–The research underscores the importance of focusing on the development of non-bulk materials,noting the insufficiency in the development of China’s rail multi-modal transportation and highlighting the instructive value of successful cases in open-top container road-rail intermodal transportation.The study posits that the railway sector must enhance cooperation with other market entities,aligning with the lead enterprises in the logistics chain that are characterized by speed,high value and strong coordination capabilities,in order to better serve the transportation market.This approach moves away from a reliance on the railway’s own capabilities alone.Originality/value–This paper offers original insights into the transformation of railway freight in China,contributing to the body of knowledge on transportation economics and logistics.It provides valuable recommendations for policymakers and industry practitioners,emphasizing the strategic importance of railway logistics in the context of China’s economic development and intense competition in the supply chain.The value of the article lies in its comprehensive understanding of the complexities involved in the adjustment of transportation structures,providing direction for the market-oriented reform of China’s railway freight sector.
基金supported by National Natural Science Foundation of China(Basic Science Center Program:61988101)National Natural Science Foundation of China(62394345,62373155,62173147)the Major Science and Technology Project of Xinjiang(No.2022A01006-4).
文摘Industrial ebullated-bed is an important device for promoting the cleaning and upgrading of oil products. The lumped kinetic model is a powerful tool for predicting the product yield of the ebullated-bed residue hydrogenation (EBRH) unit, However, during the long-term operation of the device, there are phenomena such as low frequency of material property analysis leading to limited operating data and diverse operating modes at the same time scale, which poses a huge challenge to building an accurate product yield prediction model. To address these challenges, a data augmentation-based eleven lumped reaction kinetics mechanism model was constructed. This model combines generative adversarial networks, outlier elimination, and L2 norm data filtering to expand the dataset and utilizes kernel principal component analysis-fuzzy C-means for operating condition partitioning. Based on the hydrogenation reaction mechanism, a single and sub operating condition eleven lumped reaction kinetics model of an ebullated-bed residue hydrogenation unit, comprising 55 reaction paths and 110 parameters, was constructed before and after data augmentation. Compared to the single model before data enhancement, the average absolute error of the sub-models under data enhancement division was reduced by 23%. Thus, these findings can help guide the operation and optimization of the production process.