Jump to content

Gale–Church alignment algorithm: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Changed a hyphen to semicolon
Adding short description: "Parallel text alignment algorithm"
 
Line 1: Line 1:
{{Short description|Parallel text alignment algorithm}}
In [[computational linguistics]], the '''Gale–Church algorithm''' is a method for aligning corresponding sentences in a [[parallel corpus]]. It works on the principle that equivalent sentences should roughly correspond in length; that is, longer sentences in one language should correspond to longer sentences in the other language. The [[algorithm]] was described in a [https://web.archive.org/web/20061026051708/http://acl.ldc.upenn.edu/J/J93/J93-1004.pdf 1993 paper] by [[William A. Gale]] and Kenneth W. Church of [[Bell Labs|AT&T Bell Laboratories]].
In [[computational linguistics]], the '''Gale–Church algorithm''' is a method for aligning corresponding sentences in a [[parallel corpus]]. It works on the principle that equivalent sentences should roughly correspond in length; that is, longer sentences in one language should correspond to longer sentences in the other language. The [[algorithm]] was described in a [https://web.archive.org/web/20061026051708/http://acl.ldc.upenn.edu/J/J93/J93-1004.pdf 1993 paper] by [[William A. Gale]] and Kenneth W. Church of [[Bell Labs|AT&T Bell Laboratories]].



Latest revision as of 23:35, 14 September 2024

In computational linguistics, the Gale–Church algorithm is a method for aligning corresponding sentences in a parallel corpus. It works on the principle that equivalent sentences should roughly correspond in length; that is, longer sentences in one language should correspond to longer sentences in the other language. The algorithm was described in a 1993 paper by William A. Gale and Kenneth W. Church of AT&T Bell Laboratories.

References

[edit]
[edit]
  • Gale, William A.; Church, Kenneth W. (1993), "A Program for Aligning Sentences in Bilingual Corpora" (PDF), Computational Linguistics, 19 (1): 75–102