分享好友 知识库首页 频道列表

PROCESS Of ALIGNMENT OF WORDS AND SYSTEM FOR a COVER AMELIOREE OF VOCABULARY IN STATISTICAL AUTOMATI

2025-06-18 10:233460下载
文件类型:PDF文档
文件大小:352K
A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token of the first text string to a single token of the second text string. A second alignment also creates links between the text string pair. In some cases, these links may correspond to bi-phrases. A modified first alignment is generated by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment. This results in removing at least some of the links for the infrequent words, allowing more compact and better quality bi-phrases, with higher vocabulary coverage, to be extracted for use in a machine translation system.


请登录查看


反对 0
举报 0
收藏 0
打赏 0
评论 0