客服热线:18202992950

PROCESS Of ALIGNMENT OF WORDS AND SYSTEM FOR a COVER AMELIOREE OF VOCABULARY IN STATISTICAL AUTOMATI 发明申请

2023-02-01 3460 352K 0

专利信息

申请日期 2025-07-07 申请号 FR11055175
公开(公告)号 FR2961325A1 公开(公告)日 2011-12-16
公开国别 FR 申请人省市代码 全国
申请人 XEROX CORP
简介 A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token of the first text string to a single token of the second text string. A second alignment also creates links between the text string pair. In some cases, these links may correspond to bi-phrases. A modified first alignment is generated by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment. This results in removing at least some of the links for the infrequent words, allowing more compact and better quality bi-phrases, with higher vocabulary coverage, to be extracted for use in a machine translation system.


您还没有登录,请登录后查看下载地址


反对 0举报 0 收藏 0 打赏 0评论 0
下载排行
网站首页  |  关于我们  |  联系方式  |  使用协议  |  版权隐私  |  网站地图  |  排名推广  |  广告服务  |  积分换礼  |  网站留言  |  RSS订阅  |  违规举报  |  京ICP备2021025988号-4