PDF] Arabic to French Sentence Alignment: Exploration of A Cross
Por um escritor misterioso
Last updated 22 outubro 2024
A new approach to aligning sentences from a parallel corpus based on a cross-language information retrieval system is presented and it is shown that alignment has correct precision and recall even when the corpus is not completely parallel. Sentence alignment consists in estimating which sentence or sentences in the source language correspond with which sentence or sentences in a target language. We present in this paper a new approach to aligning sentences from a parallel corpus based on a cross-language information retrieval system. This approach consists in building a database of sentences of the target text and considering each sentence of the source text as a "query" to that database. The cross-language information retrieval system is a weighted Boolean search engine based on a deep linguistic analysis of the query and the documents to be indexed. This system is composed of a multilingual linguistic analyzer, a statistical analyzer, a reformulator, a comparator and a search engine. The multilingual linguistic analyzer includes a morphological analyzer, a part-of-speech tagger and a syntactic analyzer. The linguistic analyzer processes both documents to be indexed and queries to produce a set of normalized lemmas, a set of named entities and a set of nominal compounds with their morpho-syntactic tags. The statistical analyzer computes for documents to be indexed concept weights based on concept database frequencies. The comparator computes intersections between queries and documents and provides a relevance weight for each intersection. Before this comparison, the reformulator expands queries during the search. The expansion is used to infer from the original query words other words expressing the same concepts. The search engine retrieves the ranked, relevant documents from the indexes according to the corresponding reformulated query and then merges the results obtained for each language, taking into account the original words of the query and their weights in order to score the documents. The sentence aligner has been evaluated on the MD corpus of the ARCADE II project which is composed of news articles from the French newspaper "Le Monde Diplomatique". The part of the corpus used in evaluation consists of the same subset of sentences in Arabic and French. Arabic sentences are aligned to their French counterparts. Results showed that alignment has correct precision and recall even when the corpus is not completely parallel (changes in sentence order or missing sentences).
Evaluation of Tau Radiotracers in Chronic Traumatic Encephalopathy
Respect Life Denver - Catholic Charities of Denver
Entropy, Free Full-Text
PDF) Arabic to French Sentence Alignment: Exploration of a Cross-language Information Retrieval Approach
All Eyes on Egypt: Islam and the Medical Use of Dead Bodies Amidst Cairo's Political Unrest: Medical Anthropology: Vol 35, No 3
New and enhanced features Latest release of InDesign
PDF] Arabic to French Sentence Alignment: Exploration of A Cross-language Information Retrieval Approach
Kafalah by ISS/IRC - Issuu
Planet Mozilla
Transition Metal Product Market in 2031: Exploring Growth Avenues with Top Key Players
Recomendado para você
-
First Grade Wow: Cross Checking22 outubro 2024
-
Cross-check - Definition, Meaning & Synonyms22 outubro 2024
-
Perception Checking: 15 Examples and Definition (2023)22 outubro 2024
-
Read each sentence. put a check in the box if the sentence is TRUE22 outubro 2024
-
Essential Elements of Technical Writing: A Guide for Technical22 outubro 2024
-
Mad Libs Criss Cross A Silly Sentence Game Part Games Family Fun22 outubro 2024
-
Decodable Readers Multisyllables Open Syllables Books and Lesson22 outubro 2024
-
Scrabble Quip Qubes Word cross sentence Board Game 198122 outubro 2024
-
Metacommentary: Definition and Examples (2023)22 outubro 2024
-
Animal Adaptations Modified Assignment (Project-based Learning Accompanimen22 outubro 2024
você pode gostar
-
The Beloved Country - South African Stories22 outubro 2024
-
Every Connection Between Hideo Kojima's OD & Silent Hill (So Far)22 outubro 2024
-
Tengoku Daimakyou - Dublado - Heavenly Delusion, Tengoku Daimakyou: Ilusão Celestial22 outubro 2024
-
Memphis Depay da Holanda, comemora o seu gol durante a partida22 outubro 2024
-
Pokemon Ultra Arceus X GX22 outubro 2024
-
The 11 Best Jack Black Movies of All Time - IGN22 outubro 2024
-
Fundy Avatar' Sticker22 outubro 2024
-
Pesque pague e club Raio de Sol - Águas Lindas de Goiás - GO22 outubro 2024
-
SPEEDLINK SL-650212-BKRD Competition PRO EXTRA USB Joystick - Anniversary Edition, Retro-Arcade-Stick, schwarz-rot22 outubro 2024
-
Quanto vale o Campeonato Paulista: Descubra os valores que cada22 outubro 2024