Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word a...Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word aligned bilingual corpus,while ignoring the effect of the number of adjacent bilingual phrases.In this paper,we propose a method to take the number of adjacent phrases into account for better estimation of reordering models.Instead of just checking whether there is one phrase adjacent to a given phrase,our method firstly uses a compact structure named reordering graph to represent all phrase segmentations of a parallel sentence,then the effect of the adjacent phrase number can be quantified in a forward-backward fashion,and finally incorporated into the estimation of reordering models.Experimental results on the NIST Chinese-English and WMT French-Spanish data sets show that our approach significantly outperforms the baseline method.展开更多
The importance of Open Source Software(OSS)has increased in recent years.OSS is software that is jointly developed and maintained globally through open collaboration and knowledge sharing.OSS plays an important role,e...The importance of Open Source Software(OSS)has increased in recent years.OSS is software that is jointly developed and maintained globally through open collaboration and knowledge sharing.OSS plays an important role,especially in the Information Technology(IT)field,by increasing the efficiency of software development and reducing costs.However,licensing issues,security issues,etc.,may arise when using OSS.Some services analyze source code and provide OSS-related data to solve these problems,a representative example being Blackduck.Blackduck inspects the entiresource code within the project and provides OSS information and related data included in the whole project.Therefore,there are problems such as inefficiency due to full inspection of the source code and difficulty in determining the exact location where OSS is identified.This paper proposes a scheme to intuitively analyze source code through Graph Modelling Language(GML)conversion to solve these problems.Additionally,encryption is applied to GML to performsecure GML-based OSS inspection.The study explains the process of converting source code to GML and performing OSS inspection.Afterward,we compare the capacity and accuracy of text-based OSS inspection and GML-based OSS inspection.Signcryption is applied to performsafe,GML-based,efficient OSS inspection.展开更多
基金supported by the National Natural Science Foundation of China(No.61303082) the Research Fund for the Doctoral Program of Higher Education of China(No.20120121120046)
文摘Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word aligned bilingual corpus,while ignoring the effect of the number of adjacent bilingual phrases.In this paper,we propose a method to take the number of adjacent phrases into account for better estimation of reordering models.Instead of just checking whether there is one phrase adjacent to a given phrase,our method firstly uses a compact structure named reordering graph to represent all phrase segmentations of a parallel sentence,then the effect of the adjacent phrase number can be quantified in a forward-backward fashion,and finally incorporated into the estimation of reordering models.Experimental results on the NIST Chinese-English and WMT French-Spanish data sets show that our approach significantly outperforms the baseline method.
基金supported by SW Copyright Ecosystem R&D Program through the Korea Creative Content Agency grant funded by the Ministry of Culture,Sports and Tourism in 2024(RS-2023-00224818)The project is titled“Development of Large-Scale Software License Verification Technology Using Cloud Services and Construction Types”,with a contribution rate of 100%.
文摘The importance of Open Source Software(OSS)has increased in recent years.OSS is software that is jointly developed and maintained globally through open collaboration and knowledge sharing.OSS plays an important role,especially in the Information Technology(IT)field,by increasing the efficiency of software development and reducing costs.However,licensing issues,security issues,etc.,may arise when using OSS.Some services analyze source code and provide OSS-related data to solve these problems,a representative example being Blackduck.Blackduck inspects the entiresource code within the project and provides OSS information and related data included in the whole project.Therefore,there are problems such as inefficiency due to full inspection of the source code and difficulty in determining the exact location where OSS is identified.This paper proposes a scheme to intuitively analyze source code through Graph Modelling Language(GML)conversion to solve these problems.Additionally,encryption is applied to GML to performsecure GML-based OSS inspection.The study explains the process of converting source code to GML and performing OSS inspection.Afterward,we compare the capacity and accuracy of text-based OSS inspection and GML-based OSS inspection.Signcryption is applied to performsafe,GML-based,efficient OSS inspection.