Legal argumentation analysis represents a critical yet underexplored domain in computational linguistics,particularly for cross-lingual contexts where high-quality parallel corpora with argumentative annotations remai...Legal argumentation analysis represents a critical yet underexplored domain in computational linguistics,particularly for cross-lingual contexts where high-quality parallel corpora with argumentative annotations remain scarce.This paper introduces the Hong Kong Legal Judgments Argumentative Corpus(HKLJ-Arg),a novel bilingual Chinese-English parallel corpus comprising 557 professionally translated paragraph pairs extracted from 15 legal judgments spanning 2012-2025.Our corpus uniquely leverages Hong Kong’s bilingual legal framework to provide high-fidelity parallel translations with comprehensive argumentative structure annotations,including claims,premises,examples,and their logical relationships.The annotation methodology employs a prompt-engineered approach using large language models,validated through expert evaluation achieving Cohen’sκof 0.79 for component identification and 0.72 for relationship classification.We further present the Argumentation Preservation Assessment Framework(APAF),a systematic methodology for evaluating how effectively machine translation systems preserve argumentative structures across linguistic boundaries.Statistical analysis reveals diverse argumentative patterns across constitutional,criminal,and civil law domains,with systematic preservation of logical structures across languages.This resource addresses a critical gap in legal natural language processing by providing the first large-scale bilingual corpus specifically designed for cross-lingual legal argumentation mining,translation quality assessment,and comparative legal reasoning analysis.展开更多
Language is a symbolic system consisting of the signifier and the signified.There is an asymmetrical binary correspondence between them,which is also widespread in Russian linguistics.There are two main types of inter...Language is a symbolic system consisting of the signifier and the signified.There is an asymmetrical binary correspondence between them,which is also widespread in Russian linguistics.There are two main types of interrogative sentence formed by the interrogative pronounкакой.Traditional lexicography and grammar have limitations in their understanding ofкакойand of the interrogative sentences it forms.The analysis of such syntactic phenomena should be included in the Q–A unity,which can be described in terms of argument structures,generic relations,deixis,relevant characteristics,properties,or attributes of objects,etc.展开更多
基金supported by research grants from Xi’an Jiaotong-Liverpool University Research Development Funding(RDF-22-01-053)Open Project of Anhui Provincial Key Laboratory of Multimodal Cognitive Computation,Anhui University(MMC202414)Duke Kunshan University.
文摘Legal argumentation analysis represents a critical yet underexplored domain in computational linguistics,particularly for cross-lingual contexts where high-quality parallel corpora with argumentative annotations remain scarce.This paper introduces the Hong Kong Legal Judgments Argumentative Corpus(HKLJ-Arg),a novel bilingual Chinese-English parallel corpus comprising 557 professionally translated paragraph pairs extracted from 15 legal judgments spanning 2012-2025.Our corpus uniquely leverages Hong Kong’s bilingual legal framework to provide high-fidelity parallel translations with comprehensive argumentative structure annotations,including claims,premises,examples,and their logical relationships.The annotation methodology employs a prompt-engineered approach using large language models,validated through expert evaluation achieving Cohen’sκof 0.79 for component identification and 0.72 for relationship classification.We further present the Argumentation Preservation Assessment Framework(APAF),a systematic methodology for evaluating how effectively machine translation systems preserve argumentative structures across linguistic boundaries.Statistical analysis reveals diverse argumentative patterns across constitutional,criminal,and civil law domains,with systematic preservation of logical structures across languages.This resource addresses a critical gap in legal natural language processing by providing the first large-scale bilingual corpus specifically designed for cross-lingual legal argumentation mining,translation quality assessment,and comparative legal reasoning analysis.
基金This paper is a partial result of the research“On the interaction between Q&A in Russian wh-questions with interrogative pronouns”(No.2020SJA1361)which is sponsored by the Philosophical and Social Science Research Projects in Jiangsu Universities.
文摘Language is a symbolic system consisting of the signifier and the signified.There is an asymmetrical binary correspondence between them,which is also widespread in Russian linguistics.There are two main types of interrogative sentence formed by the interrogative pronounкакой.Traditional lexicography and grammar have limitations in their understanding ofкакойand of the interrogative sentences it forms.The analysis of such syntactic phenomena should be included in the Q–A unity,which can be described in terms of argument structures,generic relations,deixis,relevant characteristics,properties,or attributes of objects,etc.