WO2017159906A1 - Data structure for determining translation order of words included in source language text, program for generating data structure, and computer-readable storage medium storing same - Google Patents

Data structure for determining translation order of words included in source language text, program for generating data structure, and computer-readable storage medium storing same Download PDF

Info

Publication number
WO2017159906A1
WO2017159906A1 PCT/KR2016/002909 KR2016002909W WO2017159906A1 WO 2017159906 A1 WO2017159906 A1 WO 2017159906A1 KR 2016002909 W KR2016002909 W KR 2016002909W WO 2017159906 A1 WO2017159906 A1 WO 2017159906A1
Authority
WO
WIPO (PCT)
Prior art keywords
translation
order
small
units
unit
Prior art date
Application number
PCT/KR2016/002909
Other languages
French (fr)
Korean (ko)
Inventor
이시용
Original Assignee
이시용
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 이시용 filed Critical 이시용
Publication of WO2017159906A1 publication Critical patent/WO2017159906A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Definitions

  • the present invention relates to determining a translation word order between an original sentence and a translated sentence when the original language sentence is translated into a target language sentence.
  • the present invention relates to translation word order pattern data that is divided into small translation units that are translation subunits that match the translation, and determines the translation word order between the small translation units.
  • a translation order pattern data structure for determining a translation order in small translation units constituting translation subunits of an original sentence, a computer readable storage medium storing computer executable instructions for generating the translation sequence, and a translation program using the instructions It is about.
  • the translator can translate the original text into multiple translations that convey the same meaning, whereas the existing automatic translation outputs only one universal translation result that can understand the original text for the same sentence.
  • a professional translator has a problem that cannot be automatically translated while reflecting his / her translation personality in the process of automatic translation.
  • the present invention prior to translation by analyzing the original language stemming from the original sentence in the original language to the translation sentence in the target language and by dividing the original sentence into small translation units which are translation subunits to determine the translation word order in advance.
  • the purpose is to provide a predetermined translation order.
  • the purpose of the present invention is to determine the translation order in order to increase the accuracy of automatic translation up to the level of automatic translation between languages of the same system such as Japanese and Korean, in order to increase the accuracy of automatic translation between languages with different translation order. have.
  • a translation word order pattern data structure stored in a computer-readable storage medium which is used for translation in a translation apparatus for translating an original sentence into a translated sentence according to an aspect of the present invention, includes a plurality of small translation units from the first sentence to the end of the original sentence.
  • Sub-translational unit division pattern data for dividing into sub-translation units wherein the sub-translational unit division pattern data includes one or more parts of morphemes obtained by morphological analysis of the original sentence, and each of the plurality of sub-translation unit division pattern data. Contains the specified translation sequence number data.
  • a translation order pattern data structure stored in a computer-readable storage medium translates small translation units divided from original text by small translation unit division pattern data, and the small translation units are translated according to translation order sequence data. It further includes output data outputting the translated text of the original text by sorting the small translations of each translation unit.
  • each of the small translation unit division pattern data includes a division pattern instructing the start of division into each small translation unit, a division part indicating the beginning of the division, and an end of division into each small translation unit.
  • a small translation unit segmentation pattern beginning portion comprising one or more parts of speech located at the beginning of the morphological sequences of the small translation unit segmentation pattern data, and a segmentation pattern ending portion being positioned last in the morphological sequence of the small translation unit segmentation pattern data It may include one or more parts of speech, and the middle portion may be omitted.
  • the small translation unit division pattern start part is composed of parts of speech located at the front of the morphological sequences of the small translation unit division pattern data
  • the division pattern end part is the last of the morphological sequences of the small translation unit division pattern data. Consists of positioned parts of speech, and the intermediate portion may be omitted.
  • the beginning of the sub-translational unit division pattern consists of one or two of the morphological sequences of the sub-translational unit division pattern data
  • the end of the division pattern is the last of the morphological sequences of the sub-translation unit division pattern data. It consists of parts of speech from the noun to the end of the morphological sequence of the sub-translational unit division pattern data, and the middle part may be omitted.
  • a sentence translation including a survey, ending, or prepositional band word to be positioned at the end of a small translation unit of a small translation unit to be divided by the small translation unit division pattern data includes the plurality of small translation unit division pattern data. Specified in one or more of these.
  • the original text sentence (1) is used for translation in a translation apparatus for translating the translated text sentence (2), and the original text sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units
  • a computer readable storage medium having stored thereon computer executable instructions for generating a translation order pattern data structure for determining a structure, the computer executable instructions comprising dividing an original sentence into a plurality of small translation units from the beginning to the end and Generating small translation unit division pattern data (300) from translation units, wherein the small translation unit division pattern data (300) includes at least one part-of-speech (part of speech) morphologically analyzed from the original text; And generating translation word order data (5) designated in each of the plurality of small translation unit division pattern data (300).
  • a computer-readable storage medium including instructions for generating a translation order pattern data structure, the instructions translate small translation units divided from original text by small translation unit division pattern data, And sorting the small translations, which are translations of each of the small translation units, according to the translation order number data, and outputting a translation sentence of the original sentence.
  • Computer readable storage of computer executable instructions for generating a translation order pattern data structure stored in a computer readable storage medium used for translation in a translation apparatus for translating original text into translated text according to an aspect of the present invention Generating, by the computer executable instructions, small translation unit division pattern data for dividing the original sentence into a plurality of small translation units from the beginning to the end; the small translation unit division pattern data stemming the original sentence. Generating at least one part-of-speech of the analyzed morpheme sequences; and generating translation order sequence data specified in each of the plurality of sub-translational unit division pattern data.
  • a computer readable storage medium having stored thereon computer executable instructions for generating a translation pattern data structure, the computer executable instructions comprising: a) retrieving an original text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code.
  • Computer executable instructions for dividing the original text into small translation units which are translation subunits, used in a translation apparatus for translating the original text sentence into a translation sentence, and for determining the translation order of the small translation units according to an aspect of the present invention.
  • Computer readable storage medium having stored thereon the computer executable instructions comprising: a) retrieving a text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code.
  • Small translation unit segmentation pattern data A translation order sequence data structure that generates a plurality of small translation unit division pattern data including one or more parts of speech, and a translation sequence sequence data structure including a translation sequence number assigned to each of the plurality of small translation unit division pattern data. To generate;
  • the original text is divided into small translation units which are translation subunits, and the translation order of the small translation units is used for translation in the translation apparatus for translating the original text sentence into a translation sentence according to an aspect of the present invention.
  • a computer readable storage medium having stored computer executable instructions for determining, the computer executable instructions comprising: a) retrieving a text sentence and displaying it to a user; b) stemming the original text and tagging parts of speech into the analyzed morphemes, wherein stemming the original text and tagging the parts of speech comprises stemming the original code and tagging the original code; c) a first signal including the specific position information in the original sentence and indicating a sequential translation order in ascending order to the front and rear sub-translation units divided based on the specific position, and the specific in the original sentence or sub-translation unit Receiving a second signal including position information and indicating a reverse translation order in descending order to front and rear sub-translation units divided based on the specific location;
  • a translation word order pattern data structure for generating a plurality of small translation unit division pattern data including at least one part of speech, and a translation order sequence data structure including a translation order number assigned to each of the plurality of small translation unit division pattern data Generating a structure.
  • the step of dividing the original text into small translation units and determining the translation order of the small translation units may include: a small translation unit to which a specific position indicated by the first signal belongs according to the first signal; Alternatively, the original sentence is divided into two front and rear sub-translation units based on the specific position, and the sequential order of ascending order is determined in the two sub-translation units divided, and the sequential order between the divided front and rear sub-translation units is divided. It is determined by the translation order number originally possessed by the small translation unit and the translation order number by which 1 is increased to the translation language sequence, and other small translation unit other than the small translation unit to which the specific position indicated by the first signal belongs.
  • Another translation unit having the same or greater order if the translation order is the same or greater than the increased order Increase the number of translation order specified in the field by 1, and convert the sub-translation unit or original sentence belonging to a specific position indicated by the second signal according to the second signal into two front and rear sub-translation units based on the specific position.
  • the translation order number of the small translation unit other than the small translation unit to which the specific position indicated by the first signal belongs is determined by the word order number and the translation order number originally possessed, and is equal to or greater than the increased order number. Increasing the translation order assigned to other small translation units having the same or larger order by 1 It should.
  • a computer-readable storage medium storing computer executable instructions for dividing an original text into sub-translation units that are translation sub-units and determining translation order of the sub-translation units, according to an aspect of the present invention.
  • Heard a) retrieving the original text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code.
  • Hamm And c) comparing the morphological sequences of the sub-translational unit division pattern data of the translated word-order pattern data stored in the translated word-order pattern DB including the translated word-order pattern data structure to retrieve matching word-order pattern data.
  • a translation program according to an aspect of the present invention is stored in a computer-readable storage medium that divides the above original text into small translation units which are translation subunits and stores computer executable instructions for determining the translation order of the small translation units. Perform the translation using the commands.
  • the original text sentence (1) is used for translation in a translation apparatus for translating the translated text sentence (2), and the original text sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units
  • the translation order pattern data may determine the translation order between heterogeneous languages, such as refractive art, interlocking words, and independent languages, as well as translation order between homogeneous languages (eg, Korean and Japanese). Since the translation order pattern data of the present invention plays a role of treating the languages of heterogeneous lines like the word order of the same language, such as English and Korean, English and Japanese, the accuracy of automatic translation between heterogeneous line languages is the same. It can increase the accuracy of automatic translation.
  • heterogeneous languages such as refractive art, interlocking words, and independent languages
  • homogeneous languages eg, Korean and Japanese
  • the translator when the translator divides the original text into sub-translational units which are translation subunits and determines the translation sequence order in these sub-translation units, the translator stores the translation sequence stored by the user in a later translation. It is available.
  • a translation beginner may complete a translation sentence according to a translation word order divided into small translation units before translation using a translation word order DB accumulated by another person such as a translation expert.
  • the translation order pattern data structure includes the small translation unit division pattern data and the translation order sequence data, and the small translation unit division pattern data and the translation order sequence data are information on each small translation unit division pattern in the translation order structure.
  • the term refers to data having data and data having information on a sequence number of a translation.
  • “translation order sequence data”, “small translation unit division pattern data” and “translation order sequence data” are referred to herein as “translation order sequence pattern”, “small translation unit division pattern”, and "word order”. Used in combination with
  • the translation word order pattern of the present invention can be used regardless of the target language because the translation order is assigned to the division patterns for dividing the original text into small translation units which are translation subunits. For example, if the original language is English and the target language is Korean, the original English sentence is analyzed and the Korean translation order for the original English is specified. Applicable That is, the translation order pattern obtained when translating English into Korean can be used as it is in translation from English to Japanese.
  • FIG. 1A is a diagram illustrating an example of dividing an original text included in a translation apparatus for translating an original text into a translation into small translation units, which are translation subunits, and storing computer executable instructions for determining the translation order of the small translation units, according to an embodiment of the present invention.
  • FIG. 1B is an enlarged view of a small translation unit and a translation word order determining unit of FIG. 1A.
  • FIG. 1C is an enlarged view of a small translation and a translation generation unit of FIG. 1A.
  • FIG. 1D is an enlarged view of the user interface unit of FIG. 1A.
  • FIG 2A illustrates the original and translated sentences used for embodiments of the present invention.
  • 2B illustrates one embodiment of a compound sentence division word order according to the present invention.
  • 2C illustrates one embodiment of a short division word order rule in accordance with the present invention.
  • FIG. 2D illustrates an embodiment morphologically analyzing the original text of FIG. 2A and tagging the part-of-speech.
  • 3A to 3H illustrate a process of dividing the original text of FIG. 2A into a plurality of small translation units according to a division word order rule, determining a translation order number for each small translation unit, and then generating a small translation.
  • a division word order rule determining a translation order number for each small translation unit, and then generating a small translation.
  • An example is shown.
  • FIG. 3I illustrates an embodiment of dividing an original sentence into three small translation units in FIG. 3C, determining a translation order, and translating a small translation for each small translation unit to complete a translation.
  • FIG 4A illustrates an embodiment of translation word pattern data according to the present invention.
  • 4B illustrates exemplary embodiments of expanded word order pattern data in which a middle portion of a split pattern is omitted based on the word order pattern data and the word order pattern data according to an embodiment of the present invention.
  • FIG. 4C illustrates an embodiment in which the translation order pattern data including the split pattern data and the order data specified therein further includes a sentence.
  • FIG. 4D illustrates embodiments of translation word pattern data generated after division into small translation units of FIGS. 3B to 3F according to an embodiment of the present invention.
  • FIG. 4E illustrates embodiments of translation order pattern data extendable from the translation order pattern of FIG. 4A in accordance with an embodiment of the present invention.
  • 5A to 5M illustrate that the translation order is divided into small translation units for the original text in the small translation unit and the translation order determination unit, and the translation order is determined for each small translation unit according to an embodiment of the present invention.
  • Figure 5n is divided into small translation units for the original text according to an embodiment of the present invention after determining the translation order number for each small translation unit after the translation is generated by the automatic translation in the small translation and translation generation unit Illustrated.
  • 6A to 6H illustrate a process of generating a translation sentence based on the stored translation order pattern data after the original text of FIG. 2A is stored in the translation order pattern DB as the translation order patterns of FIGS. 4A to 4D.
  • FIG. 7A illustrates an embodiment of a method for dividing an original text into small translation units, which are translation subunits, and determining a translation word order of the small translation units, according to an exemplary embodiment of the present invention.
  • FIG. 7B illustrates an embodiment of a method and a translation apparatus using the same for dividing an original text into small translation units, which are translation subunits, and determining the translation word order of the small translation units, according to an embodiment of the present invention.
  • 7C is another embodiment of a method for dividing an original text into small translation units, which are translation subunits, and determining a translation word order of the small translation units, according to an embodiment of the present invention, and a translation apparatus using the same.
  • the translation order pattern data structure stored in a computer-readable storage medium used for translation in a translation apparatus for translating an original sentence (1) into a translation sentence (2) is composed of a plurality of small translation units from the beginning to the end of the original sentence.
  • the small translation unit division pattern data 300 for division, and the translation order sequence data 5 specified in each of the plurality of small translation unit division patterns 300 are included.
  • it translates the small translation units divided from the original sentence sentence 1 by the small translation unit division pattern data 300, and the small translation sentences (4) which are the translations of each of the small translation units according to the translation order.
  • the output data outputting the translated sentence 2 of the original sentence may be stored in a computer readable storage medium, for example, a memory, a hard disk, etc., and displayed on the display unit 141 for display to the user, or in the order of translation. In order to generate the pattern data, it may be transmitted to the translation word order pattern generation unit 150 or the small translation memory and the translation memory DB.
  • 4A is a translation word order pattern (401, 405) specifying a translation order in order for a morphological analysis of original texts and small translation unit division patterns obtained by converting the small translation units divided in FIG. 3F into morpheme sequences according to an embodiment of the present invention; Exemplary embodiments of the expanded translation order patterns 402-404 and 406-408, which omit the split pattern middle part based on the translation order patterns, are illustrated.
  • FIG. 4A AR AANN 301, VV AR AN 321, R AR DAN PP N 361, PL PP VAN PR ( 371), V PV SY 381 and D PP N RV SY 391 are the six small translation unit division patterns for the small translation units 30, 32, 36, 37, 38, 39 of FIG. 300, translation word sequence numbers 5 are assigned to each of the small translation unit division patterns 300.
  • FIG. The small translation unit (3) divided by the small translation unit division pattern (300) in the translation apparatus or translation program of the present invention is translated, and the small translation unit (3) according to the translation order (5) When the small translations 4 which are the translations of each of the two are arranged, the translated sentence 2 of the original sentence 1 is obtained.
  • Each of the sub-translational units (3) includes a sub-translational unit division pattern that includes one or more parts of speech in the morphological sequence of the sub-translational unit through morphological analysis and part-of-speech tagging for use in dividing other original text into sub-translational units. 300).
  • each of the small translation units 3 may be represented by a small translation unit division pattern 300 including one or more of a source word, a code, and a part-of-speech that parsed them (including 'for'). 4b second row).
  • Each of the small translation unit division patterns 300 is applied as a division rule for dividing the small translation unit into small translation units when there are morpheme sequences that match each of the small translation unit division patterns among the morpheme sequences obtained by morphological analysis of another sentence.
  • the translation word order of the divided sub-translation units is determined according to the relations with the parts of speech of the front-rear sub-translation units forming a boundary with the sub-translation unit division pattern.
  • the translation word order may be determined by designating the word order between the small translation units that are divided while the user divides one original text into a plurality of small translation units.
  • the user when the sub-translation units divided by the original text and the translation order assigned thereto are displayed to the user, and the user corrects an incorrect word order and inputs the translation-translation order assigned to the small translation units and each small translation unit, the user The translation order sequence is determined from the sub-translation units entered by the user and the translation order assigned to each sub-translation unit.
  • the original text is divided into sub-translational units (3), which are translation sub-units, and each division pattern 300 which is morphologically analyzed as / 1, / 6, / 3, / 2, / 5, and / 4.
  • a translation word order (5) is specified.
  • These translation order patterns are stored in the translation order pattern DB 160, and when the same small translation unit division pattern appears when translating another sentence, the other sentences are divided into small translation units by the same small translation unit division pattern. If all six small translation unit division patterns are matched, the same original translation order pattern is applied, and the original sentence is divided into small translation units only by the translation word order without the division word order decision rule that can be mechanically divided.
  • the order of translation can be determined between units.
  • the translation order patterns 402 to 404 and 406 to 408 derived based on the translation order patterns 401 and 405 may be generated in the translation order pattern generating unit 150.
  • the small translation unit division pattern included in the translation word order pattern of FIG. 4A includes a division pattern start part including one or more parts of the division pattern morpheme sequences, a division pattern middle part, and an end division pattern including one or more parts of the division pattern morpheme sequences. It shows what consists of wealth.
  • the division pattern start part and the division pattern end part may be composed of one or more parts of speech included in the small translation unit division pattern, and the middle part of the division pattern may be omitted.
  • the translation word order pattern generating unit 150 omits the middle part of the split pattern, and includes small translation unit split patterns including a split pattern start part and a split pattern end part, and a translation word sequence number specified in the small translation unit split patterns.
  • a translation word order pattern can be generated.
  • the beginning of the division pattern includes one or more parts of speech located at the beginning of the morphological sequence of the sub-translational unit, from which the division into the sub-translational units from the beginning to the end of the original sentence begins.
  • the segmentation pattern end part includes one or more parts of speech located last in the morphological sequence of the small translation unit, in which the division into the small translation units is terminated.
  • the segmentation part is between the pre-positional part and the post-partial part-of-speech.
  • the original text can be divided into sub-translational units by the sub-translational unit division pattern.
  • the beginning of the sub-translational unit division pattern corresponds to the beginning of the division pattern of the sub-translational unit including post-parts of the front and rear sub-translation units divided by the divisional word order division pair.
  • the end portion of the sub-translational unit division pattern corresponds to the end portion of the sub-translational unit division pattern including prepositions among the front and rear sub-translation units divided by the divisional word order division pair.
  • the first small translation unit split pattern AR *** N has one split pattern start part AR and one split pattern end part N, respectively. It consists of parts of speech, and the middle part of the division pattern (***) indicates that three parts of speech are omitted.
  • Split pattern middle part skipped In the first row 403 of the extended translation order pattern 2 the first sub-translational unit split pattern AR A *** N has two parts-of-speech at the beginning of the split pattern AR A and the split pattern end part ( N) is a part-of-speech part N to the end of the morpheme sequence of the split pattern, and the middle part (**) of the split pattern indicates that two parts of speech are omitted.
  • the split pattern end part N PR of the fourth split pattern is represented by a part-of-speech PR (bracket) from the noun N to the end. Omitting the split pattern middle part
  • the first row 404 of the translation word order pattern 3 includes the split pattern start part (AR, VR, PL, V, D) 302 and the split pattern end part (N, N). , N, PR, SY, RV) 303, and the middle part of the division pattern is represented by a form (-) in which all the division patterns are omitted.
  • each division pattern is represented by a number counting from the beginning to the part-of-speech position 395 where the division unit ends.
  • the translation word order pattern may be represented by the division unit 395 and the order number 5 assigned to the division unit. This notation is useful for retrieving translation order patterns with morphological sequences that match the morphological sequences of the original text.
  • the original text can be divided into sub-translational units.
  • FIG. 4B is derived based on a translation word order pattern and a translation word order pattern including a translation word order in a small translation unit division pattern divided into small translation units of FIG. 3F according to an embodiment of the present invention. Embodiments of the expanded translation order pattern omitting the middle part of the division pattern.
  • FIG. 4B is a translation order in which the last small translation unit division pattern D PP N RV SY 391 of the translation word order pattern of FIG. 4A is changed to a small translation unit division pattern including a preposition for and one or more parts of speech. Patterns 409-416 are shown.
  • the translation pattern including one or more original words and parts of speech may be a small translation unit division pattern because the translation subunit of the original sentence may be a translation unit.
  • FIG. 4C illustrates embodiments of a segmented word order pattern further including a lateral translation in a translation word order pattern generated after translation of small translations of each of the small translation units in FIG. 3H according to an exemplary embodiment of the present invention.
  • Translations of the terminology in Korean are investigated such as 'silver', 'a', 'this', 'a', 'a', and 'a', a ending such as 'a', 'a', and 'being', or May include prepositional translations such as for and about.
  • the literary translation means the translation of the small translations, which are the translations of each of the small translation units.
  • the translated word order is defined by the translation order, and the translation order of the small translation units is determined.
  • the sentence translation 396 is not specified for each division pattern like the word order, and may be omitted as necessary if the translation sentence 396 is the same as the sentence translation automatically translated. Through the sentence translation, it is possible to grasp the sentence component of the small translation unit division pattern. For example, if the grammar translation 396 in Korean is' silver 'and' is', it can be seen that the sub-translational unit is the main subject, and if 'ul' ''' 'is the object.
  • FIG. 4E illustrates embodiments of the derived translation word order pattern, such as the split pattern middle portion omitted extended word order patterns 402-404, 406-408 of FIG. 4A.
  • the extended translation word order pattern generating unit 150 may generate the extended translation word order pattern.
  • These extended translation order patterns are stored in the translation order patterns DB 160.
  • the translation order pattern DB 160 may be analyzed to find a regularity in the morpheme sequences of the division pattern and the translation order, thereby expanding the translation order pattern.
  • the generation of the extended translation order pattern can find a regularity by analyzing the translation order patterns of the translation order pattern DB 160, merging the split patterns from the already determined translation order pattern, and then assigning the missing number to the next largest order. It can be generated by decreasing one by one so that the order of the small translation unit division pattern having.
  • the arithmetically divided translation sequence pattern divided into six sub-translational units is merged into two sub-translational units and merged into five, four, three, and two sub-translational unit division patterns.
  • 4D illustrates translation word order patterns 430-441 having two to six division patterns from the original sentence of FIG. 2A, and an abbreviated translation translation order pattern 442-453 in the middle of the division pattern.
  • FIG. 4E illustrates that the translation order is again determined by omitting the parentheses in FIG. 4A that do not affect the translation order.
  • PL PP V A N PR / 2 is omitted, translation order numbers larger than 2 are reduced by one, thereby determining a new translation order number.
  • the fifth row (480-484) of FIG. 4E shows the AR AN / 1, V AR AN / by merging the relative rounds of the translation order sequential / 2, / 3 and / 4 from the translation order sequence pattern of FIG. 4A into one small translation unit.
  • An extended translation order pattern consisting of 3 and R Clause / 3 is shown.
  • the extended translation order pattern can be extended not only in relation clauses but also in rules of translation order patterns specified in sub-translational unit division patterns including specific words such as specific that clauses, which clauses, noun clauses, adverb phrases, prepositional phrases, and verb phrases. have.
  • FIG. 1A is a diagram illustrating a computer executable instruction for dividing a source sentence into sub-translation units, which are translation sub-units, and for determining translation order of the sub-translation units in an apparatus or program for translating the original sentence into a translation sentence, according to an embodiment of the present invention.
  • One embodiment is a storage medium, program, or device.
  • a storage medium, a translation program, a method and an apparatus for dividing the original text into sub-translation units, which are translation sub-units, and storing computer executable instructions for determining the translation order of the sub-translation units may be a personal computer, a mobile device, a server, or Two or more devices of may be used on a combined network.
  • the translation order pattern data structure of the present invention, a storage medium storing the computer executable instructions, or a translation program may be distributed through a server in a downloadable form.
  • a computer readable storage medium refers to a computer readable physical component or material in which data is stored optically, magnetically, or by other means, such as a computer hard disk, memory, SSD, USB, or the like.
  • FIG. 2A shows the original sentence 1 used in the present invention and the translated sentence 1 for the original.
  • the original sentence receiving unit 100 calls the original sentence 1 to be translated.
  • the morpheme analysis and part-of-speech tagging unit 110 receives the original sentence from the original sentence receiving unit 100 to perform morphological analysis and tag the part-of-speech.
  • Morphological analysis is the first step in the analysis of natural language.
  • the morphological analysis is the process of converting the original text into a morphological sequence.
  • a morpheme is the smallest unit of meaning, the smallest semantic element that can no longer be analyzed.
  • morphemes are words or parts of words that represent grammatical or relational meanings, as well as roots, primitives, surveys, prefixes, and suffixes of simple words.
  • the text symbols that make up the original sentence such as punctuation marks (',') and semicolons (';'), are also tagged with punctuation and semicolons and are treated like parts of speech.
  • Korean when Korean is the original language, parts of Korean morphemes are divided into 'nouns, pronouns, verbs, adjectives, adjectives, adverbs, interjections, investigations, endings, affixes, roots, signs, and other Korean parts of speech.' Can be tagged.
  • the parts of Chinese morphemes can be divided into 'nouns, pronouns, rhetoric, quantums, verbs, adjectives, adverbs, prepositions, investigations, signs, other than Chinese', and tag each part of speech.
  • the morpheme analysis and part-of-speech tagging unit 110 processes multiple nouns such as nouns and morphological sequences of nouns into one noun, or chunking a plurality of parts-of-speech into a part-of-speech by tying the morphological sequences of prepositions and related parties into related parts. ) May include a unit.
  • the morpheme sequence obtained by morphological analysis of the original sentence and part-of-speech tagging is inputted into the small translation unit and the translation word order determination unit 120, and is divided into the small translation units, and the translation order is determined in each small translation unit.
  • the small translation unit and the translation word order determination unit 120 include a translation word order pattern matching unit 121 and a mechanical segmentation and word order determination unit 122.
  • the translation word order matching unit 121 checks whether there is a translation word order pattern (401-484) including a small translation unit segmentation pattern that matches the morpheme sequence of the original sentence.
  • a translation word order pattern (401-484) including a small translation unit segmentation pattern that matches the morpheme sequence of the original sentence.
  • the pattern DB 160 is searched and there is a translation word order pattern 401-484 that is matched in whole or in part with the translation word order pattern DB 160, the small translation included in the matched translation word order pattern 401-484
  • the original text is divided into small translation units according to the unit division pattern, and the translation order is determined in each small translation unit.
  • the translation order pattern DB 160 is one of computer-readable storage media recording the translation order patterns 401-484.
  • the morphological sequence of the original sentence is matched globally, it means that the morphological sequence of all the sub-translational unit division patterns of the translated word sequence pattern matches the morphological sequence of the original text.
  • it means a translation word order pattern in which the morphological sequences of the remaining small translation unit division patterns except one of all the small translation unit division patterns are matched. Even if one of all the sub-translational unit division patterns is different, other matching division patterns except the other one and their sequence numbers are matched, so that the one that does not match from the matching division patterns and sequence numbers is matched. This is because the division pattern and the sequence number can be inferred.
  • the partially matched translation word order pattern may include a small translation unit division pattern beginning and a small translation unit composed of one or more of the morphological sequences of each of the small translation units when the original text is divided into small translation units.
  • the translation word order pattern is matched with the end of the small translation unit division pattern consisting of one or more positions positioned at the end of each morpheme sequence.
  • the small translation unit division patterns composed of the small translation unit division pattern start part and the small translation unit division pattern end part, and the translation word order pattern including the sequence number assigned to each of these division patterns are not matched with the morphological sequence of the original sentence as a whole. If only the start and end portions of each of the small translation unit division patterns are matched, the original text can be divided into small translation units, and the small translations are divided according to the translation word order assigned to the division patterns including the beginning and end sections.
  • the order of translation can be determined in units. Therefore, when the beginning and end of the morphological sequence of each sub-translation unit of the original sentence match all the division patterns including the division pattern start part and the division pattern end part included in the matching translation order pattern, the original sentence is divided. It can be used as the word order determination pattern to determine the order of division.
  • the mechanical division and word order determination unit 122 divides the original text into a plurality of small translation units according to the division word order rules as shown in FIGS. 2B and 2C and determines the translation word order number in the divided small translation units (FIG. 3F). Reference). Mechanical partitioning and translation ordering are further described in the section describing FIGS. 3A-3I.
  • the translation word order pattern matching unit 121 and the mechanical segmentation and word order determination unit 122 may be applied to each other. That is, when the translation order pattern matching unit 121 searches the translation order pattern DB 160 and there is no matched translation order pattern, the original sentence is mechanically translated into small translation units through the mechanical segmentation and word order determination unit 122. Can be divided and the translation order of each small translation unit can be determined. Further, after mechanically dividing the original text into small translation units through the mechanical segmentation and word order determination unit 122 and determining the translation order of each small translation unit, the translation order pattern DB is translated into the translation order pattern matching unit 121.
  • the small translation units determined through the mechanical segmentation and word order determination unit 122 and their respective designated order numbers are reset and retrieved from the translation word order matching unit 121.
  • Translation order pattern can be applied.
  • the original text is mechanically divided into small translation units through the mechanical segmentation and word order determination unit 122, and divided into the small translation units divided through the translation word pattern matching unit 121. The order of translation can be determined.
  • the translation sequence pattern matching unit 121 searches for the translation sequence pattern DB 160, and if there is a translation sequence pattern including the matching morpheme sequences, the small translation units
  • the division into a row is performed through the mechanical division and word order determination unit 122, and the translation word order of the divided small translation units may be determined as the translation word order of the translation word order pattern imported from the translation word order pattern matching unit 121. .
  • the small translation unit and the translation word order determination unit 120 output the original text sentence into a plurality of small translation units for the original text and a designated translation word sequence number for the divided small translation unit.
  • the plurality of small translation units outputted from the small translation unit and the translation word order determining unit 120 and the translation word order designated for the divided small translation unit are input to the user interface unit 140 to display the display unit 141 to the user.
  • the translation word order determining unit 120 or the translation word order pattern generating unit 150 may receive a small translation unit and a translation word order as the input items, respectively, or generate a translation word order pattern.
  • the translated word order pattern generating unit 150 may store the translated word order pattern modified by the user in the translated word order pattern DB 160.
  • the small translation unit and the translation order determination unit 120 may mechanically divide one original sentence or the small translation unit into two small translation units in the mechanical division and word order determination unit 121.
  • the display unit 141 may display the determined small translation units and the order assigned to each of them.
  • the small translation units determined mechanically in the mechanical division and word order determination unit 121.
  • a small translation unit and a translation language sequence number input through the input unit 142 in preference to the sequence number assigned to each of them, as the small translation units for the original text and their respective translation language sequence.
  • the user can directly input the signal for determining the order number for each of the small translation units and the divided small translation units through the input unit 142, as well as the location information of the divided small translation units and the divided small translations.
  • the small translation unit and the translation word order determining unit 120 are determined in the display unit 141 whenever the small translation unit and the small translation unit are divided into two small translation units by the first signal or the second signal, and these.
  • the order assigned to each can be displayed to the user.
  • the first signal is mouse left click and the second signal is consecutive mouse left click and right click.
  • the plurality of small translation units output from the small translation unit and the translation word order determining unit 120 and the translation word order designated for each small translation unit are input to the user interface unit 140.
  • the small translation and the translation generation unit 130 may include a translation memory matching unit 131 and an automatic translation unit 132, and further include a sentence translation processing unit 133.
  • the translation memory matching unit 131 may search for the translation memory DB 170 and retrieve a translation memory matching the small translation units to generate a small translation.
  • the small translation units which failed to load the translation memory because there is no matching translation memory are automatically translated through the automatic translation unit 132 (see FIG. 3G).
  • the automatic translation unit 132 can use a commercially available automatic translation engine, such as an automatic translation engine of Google Inc.
  • the small translation and the translation generation unit 130 may apply the translation memory matching unit 131 and the automatic translation unit 132 overlapping.
  • the small translation and the translation generation unit 130 generate the small translations for all the small translation units through the automatic translation in the automatic translation unit 132 and then search for the translation memory matching unit 131 and match the translation memory or the small. If there is a translation memory, the small translations may be generated for the small translation unit in which the small translation memory exists in preference to the automatically translated small translations.
  • the small translation unit and the translation order determination, and the small translation and the translation sentence generation is divided into small translation units and the original text sentence once in the small translation unit and translation language order determination unit 120 Whenever the decision is made, small translations may be generated for the plurality of small translation units divided by the small translation and the translation generation unit 130, and the total translation sentences may be generated by sorting according to the translation order assigned to the small translation units. have.
  • the small translation unit and the translated sentence generation unit 130 After generating the small translation and generating the translated sentence, transmit the small translation units, the order of translation, the generated small translations, and the translated sentence to the user interface unit 140.
  • the display unit 141 of the interface unit 140 determines the small translation units and the order designated to each of them, and the small translations for each of the small translation units and the translation order assigned to each of the small translation units. Sort them as they are, and display the completed translated sentence to the user.
  • the user modifies and inputs the divided translation order and the translation order specified for the divided small translation unit into the input unit 142, the small translation unit and the translation order, the small translation unit. You can save the translations in descending order. Whenever the user divides the original text into small translation units and determines the translation order, the user can determine the further division of the small translations, checking the translated small translations and the translated sentences for each small translation unit.
  • the translation order matching unit 121 may search the translation order pattern DB 160 for each small translation unit, and the user may match the translated translation. According to the order of the translations, the small translations and the translations generated through the translation unit 130 and the translation sentences in which the small translations are sorted, are further divided, or no longer divided, the translated small translations and translations You can decide whether to use.
  • the sentence translation unit 133 may include the small translation sentences, which are translations of the small translation units, wherein the small translation units of the original text matching the translation sequence pattern of the translation sequence pattern DB 160 are included in the translation sequence pattern.
  • the translation sentence is processed by processing a search, a mother word, or a preposition translation in accordance with the translation order.
  • the sentence translation processing unit 133 may generate a survey, a mother and a preposition translation, etc. assigned to the divided sub-translation units when determining the translation word order according to the relationship between the pre-part and the post-part of the division pair of the translation order rule. have.
  • a main search may be created in the first sub-translation unit belonging to the noun that is a prepositional part, and a verb ending ending may be created in the later sub-translation unit.
  • a verb ending ending may be created in the later sub-translation unit.
  • it may be predetermined that adjectives or affiliates modify the nouns in reverse sequential order. You can create and specify a formula ending such as'.
  • the grammar translation unit 131 also automatically translates each of the small translation units, sorts them according to the translation order, and then converts the necessary qualifiers between the small translations generated by the automatic translation for each small translation unit. And modify and add spelling to complete the translation sentence.
  • the small translation and translation generation unit 130 outputs the small translations and the translated sentences for the small translation units.
  • the small translations and the translated sentences outputted from the small translation and the translation generation unit 120 are input to the user interface unit 140 and displayed to the user through the display unit 141, and the user is provided with respect to the plurality of small translation units.
  • the small translation units and the small translation sentences for the small translation units are input through the small translation memory generation unit 130. It includes and stores the translation memory including the translation order number between each small translation unit in the translation memory DB (170).
  • the correspondence of the small translations for each of the small translation units is indicated by the subunit correspondence indicators.
  • the sub-translation unit position information of the sub-unit correspondence indicators may indicate the location subdivided into sub-units in the original text of each of the sub-translation unit, and the sub-translational position information of the sub-unit correspondence indicators Each of the translations may indicate a location within the translation sentence.
  • the translation word order pattern generating unit 150 may also generate an extended translation word order pattern that may be extended based on the translation word order patterns stored in the translation word order pattern DB 160 and store the translation word order pattern DB 160.
  • the extended translation order pattern can be used to specify the division pattern and the translation order by matching the morphological sequences of the stemmed parts of the original sentence.
  • the expanded translation order pattern will be described with reference to FIGS. 4A to 4E.
  • the small translation and the translation generation unit 130 analyze the small translation memories that are pairs of the small translation unit and the small translation from the translation memory DB 170 to generate an extended translation memory and store the translation memory in the translation memory DB 170.
  • FIG. 2B shows one embodiment of a compound sentence division word order 6 according to the invention
  • FIG. 2C shows an embodiment of a sentence division word order rule 6 according to the present invention.
  • the original sentence is first divided into small translation units by first applying the compound sentence division word order first, and then the fragment division word order rule. It takes place in the order of applying. It is necessary to first divide the clauses and clauses connecting the clauses to divide the sentence into short sentences, and then split them for each of the short sentences. Because it is word order. Also, the division pairs of nouns and verbs are determined to be divided first among the short division word order rules. This is because the predicate in the short sentence has a translation order that is translated before the predicate.
  • the divisional word order rule includes a list of split pairs of pre-parts and post-parts that are divided into two sub-translational units in the morphological sequence of the original text.
  • the pre-partial part of speech is divided between the pre-partial and post-partial parts specified in the division rule, and the pre-partial part is the last part of the previous sub-translational unit among the two sub-translational units that are divided, and the post-partial unit is the second sub-translational unit of the divided sub-translational units.
  • the division word order rule is divided into two sub-translational units according to the pre-part and post-part-of-speech of division pairs. After the part-of-speech belongs to the information on the translation order between the small translation unit and may include the priority information between the pairs of division.
  • FIG. 2D shows the result 20 of stemming and tagging the part-of-speech for the original text of FIG. 2A.
  • the symbols included in the original sentence (left brackets, brackets, commas, and periods) are treated the same as parts of speech in morphological analysis and tagged as a part of speech.
  • '(' And ')' are part-of-speech tags in left and right parentheses and are marked with PL and PR, and commas and periods are tagged with punctuation and marked with SY.
  • the division word order decision rule includes a list for the partitioning pairs of the two specific parts of speech that are to be divided between the pre-part of speech and the post-part of speech.
  • Dividing the original text into a plurality of small translation units according to the division word order determination rule and determining the translation word order in each sub-translation unit may include morphological sequence of the original sentence if there is an arrangement of parts of speech matching the division pair of the division word order determination rule.
  • the morphological sequence of the parts of speech and the code constituting the original sentence morphological analysis and part-of-speech tagging through the morphological analysis and the part-of-speech tagging unit 110 are converted into a plurality of small translation units from the beginning to the end.
  • the divided two sub-translation groups according to the sequential translation order or the reverse translation order specified in the division pair of the division word order determination rule. Determine the order of translation of the above.
  • the morphological sequence of the original text matching the pairs of pre-part and post-partial parts included in the split-pair list is performed on the entire text until the text is split from the front to the end.
  • FIG. 2B includes a list of five partition pairs
  • FIG. 2C includes a list of fourteen partition pairs.
  • the division pairs not checked in the sequential order column indicate the reverse translation order.
  • the sequential order the sequential order between the divided back and forth sub-translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation order, before being divided. If the translation order number of a small translation unit other than the small translation unit has a sequence number equal to or greater than the increased sequence number, the translation sequence number assigned to the other translation units having the same or larger sequence number is 1; Determine the reverse order in descending order of the two sub-translational units that are divided, according to the reverse translation order.
  • 1 is determined by the incremented translation order and the original translation order, and are not divided subtotal translation units. If the translation sequence number of another small translation unit has a sequence equal to or greater than the increased sequence number, the translation sequence number assigned to the other translation units having the same or greater sequence number is increased by one.
  • 2B is an example of a compound sentence segmentation rule, and illustrates a translation order sequence number for a segmentation pair.
  • the conjunctions (AND / OR) can be internally programmed to represent only concatenations connecting clauses and clauses, rather than representing all equivalence conjunctions. That is, if the first AND / OR in the morphological sequence of the original sentence is found in the parts of speech before and after the AND / OR equivalence conjunction, the verb exists before the first AND / OR, and the next conjunction including the second AND / OR after the first AND / OR. Or it can be programmed to identify the conjunction 'AND / OR' only if a verb exists between the end of the original sentence.
  • the verb does not exist before AND / OR, or after the AND / OR until the next conjunction containing the next AND / OR, or until the end of the original sentence, it is not a conjunction (AND / OR) in the division word order rule.
  • the segmentation rule includes a segmentation pair of all parts of a part-and-or (AND / OR) and all parts of a part of a conjunction (AND / OR) and all parts of a part, the AND / OR linking clauses to clauses is split back and forth to form a subtranslation unit alone. Can be configured.
  • FIG. 2C is an example of a short division word order rule, and illustrates a translation word order number for a division pair.
  • the split word order rule of FIG. 2C is an embodiment of the short word order rule, and since the short word split rule may have a difference in translation word order according to the order of word order between them when there are split pairs for sequential translation and reverse translation, Priority can be set between the pairs of partitions. For example, noun-verb, past participle-verb, present participle-verb, and noun-pronoun give priority to division pairs when dividing the morphological sequence of the original sentence that has been parsed to prioritize the subject part and the narrative part of the original sentence. You can decide.
  • the unpaired partitioned pairs are sequentially matched from the beginning to the end of the morphological sequences of the stemmed parts of the original sentence, and divided according to the partitioned pairs in the partitioned word order, and the translation order is determined. If there is a partitioning pair with priority, the partitioning word order rule takes precedence over the sequential partitioning from the beginning to the end. For example, if priorities 1 and 2 are assigned to a conjunction-all part-noun and noun-verb, even if the division pair of the conjunction-all part-of-speech is located after the divisional pair of the noun-verb, the noun-all part-of-speech must first be divided and the nouns first.
  • the verb is split into the next, and the noun-verb split pair is split before the other split pairs in the split pair list.
  • the priority among divisional pairs of noun-verb, past participle-verb, present participle-verb, and noun-pronoun can be set to 1, 2, 3, or 4, and the rest of the other division pairs Partitioning is done according to the rules of division word order from beginning to end.
  • the pre- and post-part-of-speech parts of the divisional word order rule may vary according to the language, and the translator may adjust the sub-translational units that are split by adding or deleting the division pair to be divided into the division pair list.
  • the order of translations of the divided subtranslation units is determined to be sequentially increased, in which case the segmentation order is determined so that the conjunctive conjunction is a prepositional or postpartum.
  • the small translation of the small translation unit located at the rear of the small translation unit may have a translation order of the original translation of the small translation unit located in front of the divided translation unit. It means that it is determined to have an increased translation order by 1 in the translation order.
  • the divisional word order rule may be determined such that the post-partial speech is a dependent conjunction, and the order of translation word order of the divided sub-translation units may be determined to be reduced in reverse order. It is determined that the order of decreasing in reverse order is that the small translation unit located at the rear of the divided small translation unit has the translation order number originally possessed by the small translation unit, and the small translation unit located at the front of the small translation unit before the division It means that it is determined to have the translation order number increased by 1 more than the translation order number that the small translation unit has.
  • the nouns are, for example, prepositional parts, and are divided so that the latter part of the noun is included in the suffix of the preceding sub-translational unit. This is because by dividing on the basis of the noun, the translation order can be mechanically determined after the mechanical division by the division order rule.
  • the split word order rule can increment or decrement the list of split pairs as needed by the translator. As the list of split pairs in the split word order rule increases, the number of sub-translational units constituting the original sentence increases, so that the number of translations that need to be sorted in the order of translation increases, and the number of words of the small-translation units that need to be automatically translated decreases. This can be done, but the inspection time can be increased when the translator checks that the translation is done correctly. As the list of split pairs decreases, the number of small translation units decreases and the number of translation order assigned to it decreases.
  • 3A to 3H show the original text 1 of FIG. 2A in the mechanical segmentation and translation word order determining unit 122 in accordance with the division order rules of FIGS. 2B and 2C.
  • An embodiment of a series of processes of mechanically dividing by a program and then determining the order of translation between the divided small translation units and translating each small translation unit is shown.
  • a second operating BZ2 state can be a operating state at which a very small amount of fuel (in contrast to operating state BZ1) is metered, especiallly for catalyst heating.
  • the predetermined division word order rule between specific parts-of-speech may further include subordination order information specifying whether the front and rear sub-translation units to be divided for the divisional pair of the pre-partial and post-partial parts are in ascending translation order or descending translation order.
  • subordination order information specifying whether the front and rear sub-translation units to be divided for the divisional pair of the pre-partial and post-partial parts are in ascending translation order or descending translation order.
  • the translation order number between two divided sub-translation units is determined by the translation order number originally possessed by the original text or the sub-translation unit and the translation order number increased by 1 to the translation order.
  • the original sentence 1 before splitting in FIG. 3A splits between nouns-verbs according to the short segmentation ordering rule 6 of FIG. 3B to FIG.
  • the translation order number between (30, 31) is determined by the translation order number 1 that the original text originally had and the translation order number 2 increased by 1 to the translation order number.
  • the second small translation unit 31 is the translation order sequential number 2, but when the second small translation unit 31 is divided between all parts of speech and related companies according to the short sentence division order rule (6) (at which Is pre-determined to be analyzed as a related company in morphological analysis), and the translation order between the two small translation units 32 and 33 divided in FIG. 3C is the translation order originally possessed by the small translation unit 31 before the division. It is determined by the translation order number 3, which is increased by 1 to the translation number order 2 and the translation word order 2, and the translation word order of all the parts-related part division pairs in the division word order rule of FIG. 2C is determined to be the reverse order.
  • the later small translation unit 33 of the translation units 32, 33 is determined to be the translation order sequential 2, and the translation order of the preceding small translation unit 32 is 3.
  • the translation order number of the other small translation units other than the two sub-translation units that are divided has the same order as the increased translation order or the number greater than the increased translation order Increment the translation sequence number assigned to the other sub-translation units.
  • the third small translation unit 34 of FIG. 3D is divided by all the parts-sequence pairs of parentheses in the division word order rule, the third small translation unit 36 and the fourth small translation unit of FIG.
  • the translation order number of the small translation unit (36) and the fourth small translation unit (37) has (3) and the order number 2 and 1, which had the small translation unit (34) before the division, increased to 3 Since the order of division of all parts-left brackets in reverse division order is defined in reverse order, the translation order of the smallest translation unit (36) and the fourth small translation unit (37) is sequence number 3 and sequence number 2, respectively. Becomes At this time, the translation order number of the small translation unit 32 and the fourth small translation unit 35 rather than the small translation unit 34 divided in FIG. 3D before the division was 4 and 3, respectively. 3 small and 4 small translation units (36) and 4 small translation units (37) have the same or greater order than 3 of the small translation unit (36). (36, 37) becomes 4 and 5, respectively.
  • small translations 42 are generated by automatic translation.
  • the small translation 42 may be generated by searching for a translation memory matching the small translation units.
  • the small translation and the translation sentence generation are generated in the small translation and translation generation unit 130. Can be performed. The user may determine whether to divide additionally by looking at the translation sentences whenever the division into the small translation units of FIGS. 3B to 3H and the translation order are determined in each of the small translation units.
  • FIG. 3h shows the results of processing the translations of the translations of each of the small translations in the translation order through the translation translation processing unit 133.
  • the literary translation processing unit 133 adds the translated sentence specified in the small translation unit division pattern to the automatically translated small translation or the imported small translation when the translation is divided into small translation units by the small translation unit division pattern included in the translation order sequence pattern. Delete or modify duplicate search endings, prepositional translations, etc. in order of translation order.
  • 3i shows that after the original sentence (1) is applied to only the noun-verb and all parts-related verbs in the short segmentation order, the translation order is assigned to the small translations and the respective small translations, and then for each small translation unit.
  • 5A to 5N show a small sentence unit for the original sentence, while receiving a signal from the input unit with respect to the original sentence of FIG. 2A in the small translation unit and the translation word order determination unit 120, and dividing the original sentence into small translation units which are translation subunits.
  • One embodiment for determining the translation order of translation units is shown.
  • the input unit 142 included in the user interface unit 140 of FIG. 1 includes specific location information in the original sentence or sub-translational unit and is arranged in descending order before and after the sub-translational units divided based on the specific position.
  • a first signal indicating a sequential translation order, and a first order indicating a reverse translation order in descending order to the front and rear sub-translation units including specific position information in the original sentence or sub-translation unit and divided based on the specific position. 2 Receive a signal.
  • the input unit 142 receives a first signal specifying a specific position 51 and a sequential word order in the original sentence of FIG. 5A. Accordingly, as shown in FIG.
  • the first small translation unit 30 and the second small translation unit 31 are divided based on the specific position 51, and the translation order of 1 and 2 is determined by the sequential word order, respectively. do.
  • the input unit 142 may generate the information about the specific position 51 and the sequential order as, for example, 29-1, and transmit the information to the small translation unit and the translation word order determination unit 120.
  • 29 represents the 29th counted specific position 51 including a blank at the beginning of the original sentence
  • 1 represents a sequential word order. If '29 -2 ', 2 represents the reverse order.
  • the small translation unit and the translation order determination unit 120 of FIG. 1 receive the first signal or the second signal from the input unit, and divide the original text into a plurality of small translation units according to the first signal or the second signal. Then, the translation order of the divided small translation units is determined.
  • the small translation unit and the translation order determination unit 120 receive the first signal or the second signal from the input unit, the small translation unit and the translation word order determination unit 120 are already in the translation word order pattern matching unit 121 or the mechanical segmentation and translation order determination unit 122. If there are a plurality of small translation unit division patterns 300 and each of the plurality of small translation unit division patterns, the translation sequence number 5 is reset, and the original text is reset according to the first signal or the second signal.
  • the small translation unit and the translation word order determining unit 120 refer to the small translation unit or the original sentence to which the specific position 51, 52, 53, 54, 55 indicated by the first signal belongs according to the first signal based on the specific position.
  • two sub-translational units are divided and two sub-translational units are divided.
  • the sequential order between the divided front and rear sub-translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation language order.
  • the other having the same or larger order Increment the translation order assigned to the small translation units by one.
  • the sequential order (sequence 2 and 3) between the divided front and rear sub translation units 34 and 35 is the translation order originally possessed by the small translation unit 33 (see FIG. 5E) before being divided. It is determined by the order number (number 2) and the order number of translations (number 3) which is increased by 1 in the order of translation.
  • the translation order sequential number (sequence 3) of the small translation unit 32 other than the small translation unit 33 to which the specific position 53 indicated by the first signal belongs is the same as the increased order (sequence 3) ( 5F)
  • the translation sequence number (order 3) assigned to the other small translation units 32 having the same sequence number is increased by one (see sequence 4, FIG. 5G).
  • the small translation unit and the translation word order determination unit 120 transmit the front and rear small translation units divided into the display unit 141 and the order number specified in the front and rear small translation units.
  • the small translation unit or original sentence to which the specific position indicated by the second signal belongs is divided into two front and rear small translation units based on the specific position, and the reverse order of descending two divided translation units is divided. Determine the order.
  • the reverse order between the divided front and rear sub-translation units is determined by the translation order number which is increased by 1 to the translation order sequence originally possessed by the small translation unit and the translation order sequence originally possessed by the second signal, and indicated by the second signal.
  • Translation order assigned to other small translation units having the same or larger order if the translation sequence number of a small translation unit other than the small translation unit to which a specific position belongs is the same as or greater than the increased sequence number. Increases the sequence number by 1.
  • the reverse sequence number (sequence 3 and sequence 2) between the front and rear sub-translation units 32 and 37 divided in FIG. 5J is the translation sequence sequence number (sequence 2, 2) originally possessed by the small translation unit 34 before being split. 5H), the small translation unit (34) which is determined by the translation order number (number 3) and the original translation order number (number 2) having 1 increased, and to which the specific position 54 indicated by the second signal belongs.
  • the translation order sequence (order number 4, sequence number 3) of the other small translation units 32, 35 other than) is the same as the increased sequence number (order 3) (see FIG. 5I) or greater than the increased sequence number (order 3).
  • sequence number (4) increases the translation sequence number assigned to the other small translation units 32, 35 having the same or larger sequence number by one.
  • the reverse order number (order 5 and number 4) between the front and rear sub translation units 38, 39 divided in FIG. It is determined by the translation order number (number 5) and the original number of translation word number (number 4) that 1 is increased to the translation word sequence number (number 4, see FIG. 5K), and the specific position indicated by the second signal ( 55) if the translation sequence number (sequence 5) of the small translation unit 32 other than the small translation unit 35 to which it belongs is equal to the increased sequence number (sequence 5) (see FIG.
  • the same sequence (sequence 5) Increase the translation sequence number specified in the other small translation unit (32) having 1) by 1.
  • the small translation unit and the translation word order determination unit 120 transmit the front and rear small translation units divided into the display unit 141 and the order number specified in the front and rear small translation units.
  • the small translation and the translation generation unit 130 receive the small translation units divided by the small translation unit and the translation word order determination unit 120 and the translation order assigned to them to generate the small translations for each of the small translation units.
  • the small translation sentences are arranged to generate a translation sentence, and are transmitted to the display unit 141.
  • the sentence translation processing unit 133 included in the small translation and the translation generation unit 130 receives information on the morphological sequence of the split patterns divided by the small translation unit and the translation word order determination unit 120 and the translation word order. The transliteration of the small translations of the small translation units can be processed to complete the translation.
  • the user sees the order of the small translation units and the small translation units divided from the first signal or the second signal for the original text from the display unit 141 and through the input unit 142 the new first signal.
  • the second signal may be input. After inputting the first signal or the second signal for a specific position, the user determines the translation order and accordingly displays the result of the automatic translation of the displayed original text.
  • the first signal or the second signal may be additionally input, or a portion of a small translation or a translation sentence may be input through the direct input unit 141.
  • the display of the sub-translation units and the translation order assigned to each of them may be displayed in the original text as shown in Figs. 5A to 5N, and as shown in Figs. 3A to 3I, each sub-translation unit
  • the order numbers 5 assigned to each sub translation unit may be displayed in the display area of the sub translations.
  • 5B corresponds to FIG. 3B
  • FIG. 5D corresponds to FIG. 3C
  • FIG. 5G corresponds to FIG. 3D
  • FIG. 5J corresponds to FIG. 3E
  • FIG. 5M corresponds to FIG. 3F.
  • FIG. 5N shows the small translation unit and the translation generation unit 130 after the division into the small translation units and the translation order are determined in each of the small translation units in the small translation unit and the translation word order determination unit 120 in FIGS. 5A to 5M.
  • the translation memory DB 170 search or automatic translation, small translations for each small translation unit are generated, and small translations are sorted according to the order of translation.
  • Translation pattern Stored in DB The translation order pattern Used In small translation units Determination of segmentation and translation order and generation of translation order
  • 6A to 6H illustrate a series of processes for generating a translation sentence based on the stored translation order pattern after the original text of FIG. 2A is stored in the translation order pattern DB 160 as the translation order patterns of FIGS. 4A to 4D. .
  • FIG. 6A is a text sentence 601 to be newly translated
  • FIG. 6B is a result 650 of stemming and tagging the text sentence 601.
  • 6C illustrates a translation word order pattern 462 matching the morpheme sequence 651 which is a result of morphological analysis and tagging of the original sentence 601 by searching the translation word order pattern DB 160.
  • the morpheme sequence is the result of the morphological analysis of the original sentence 651 by morphological analysis and tagging.
  • FIG. 6D shows that one or more of the morphological sequences of the original sentence 601 match the beginning and end of each of the small translation unit division patterns of the searched translation order pattern 464 in the translation order pattern DB 160. .
  • FIG. 6E is a result of dividing the small translation unit 530 by the small translation unit division patterns of the matched translation word order pattern 462 or the translation word order pattern 464 and displaying the translation order order 5.
  • 6F is a result 642 of automatic translation of each of the small translation units 530 through the small translation and the translation generation unit 130.
  • FIG. 6G shows a result 643 obtained by adding a translation of the translation of the translation order-order patterns 462 and 464 by the translation-processing unit 133.
  • FIG. 6H illustrates the slang translations in consideration of a suffix translation such as a search, a ending or a prepositional translation, which is determined from a division pattern end of each sub-translation unit division patterns and a division pattern pair at the beginning of the division pattern by the sentence translation unit 133.
  • the processing result is shown. After this processing, the translated text of the original text is completed.
  • FIG. 7A illustrates a translation word order pattern data structure for dividing an original text into small translation units that are translation subunits and determining translation order of the small translation units used in the translation apparatus or program according to the embodiment of FIG. 1A.
  • 7B is an example of a method for generating or functions for generating the same, and FIG. 7B divides an original text into a small translation unit which is a translation subunit and is used in a translation apparatus or program according to the embodiment of the present invention of FIG. 1A.
  • An embodiment of a method of determining the translation order of translation units and generating a translation is provided. These methods are performed by computer executable instructions and the instructions are stored on a computer readable recording medium.
  • the method for generating a translation order pattern data structure used for translation in a translation apparatus for translating an original sentence sentence (1) into a translation sentence sentence (2) according to the present invention includes a plurality of small translation units from the first sentence to the end of the original sentence.
  • Tagging parts of speech (1010 and 2010), dividing the original text into small translation units and determining translation order of the small translation units (1020 and 2020), the divided small translation units and the determined small translation units Displaying the translation order to the user and receiving input from the user (1030 and 2030), the translation order determined by the small translation units and the small translation units displayed to the user, and the small translation units input by the user, and Comparing the translation order number assigned to the small translation units and generating a translation order pattern including the small translation units and the translation order number assigned to the small translation units. And a step (1050 and 2050) for storing a phase (1040 and 2040), and the translated word order pattern on a translated word order pattern DB (160).
  • step 1050 of storing the translation order pattern in the translation order pattern DB 160 when another original sentence is retrieved after the step 1050 of storing the translation order pattern in the translation order pattern DB 160, the series of steps are performed again from step 1000.
  • the translation order pattern stored in the translation order pattern DB 160 in step 1050 is then retrieved in step 1022 to determine the translation order pattern when the original text is translated.
  • the method or function of generating the translated sentence of FIG. 7B includes the small translation and the translated sentence generation step 2050 and the small translation memory and the translation memory DB 170 after determining and storing the translation word order pattern of FIG. 7A.
  • the method may further include storing 2060.
  • FIG. 7B when the original text is loaded after the small translation and the translation in the small translation memory and the translation memory DB 170 (2060), the series of steps are performed again.
  • the step 1020 and 2020 of dividing the original sentence into small translation units and determining the translation word order of the small translation units may be based on a morpheme sequence obtained by morphological analysis of the original sentence. (1022 and 2022) dividing the original text into small translation units and determining the translation language order of the small translation units according to a translation word order pattern that is matched in whole or in part with the morphological sequences of translation unit division patterns. Contains a predetermined division pair for dividing between specific parts-of-speech in a morphological sequence of one original sentence tagged, and the preceding and subsequent sub-translation units to be divided by the division pair are in ascending translation order or in descending order.
  • steps 1022 and 1024 may be performed by either of the steps or both.
  • steps 2022 and 2024 may be performed by either step or both.
  • FIG. 7C illustrates a method or function for dividing an original text into small translation units that are translation subunits according to signals input by a user, and determining the order of translation of the small translation units, and a method or function for translating using the same. to be.
  • the method or function for determining the translation order of the sub-translation units for the original text by dividing the original text into the sub-translational units which are the translation sub-units is to retrieve the original text and display it to the user (3000), and to stem the original text. And tagging the parts of speech in the analyzed morphemes (3010), including a specific position information in the original sentence, and indicating a sequential translation order in ascending order to the front and rear small translation units divided based on the specific position.
  • Receiving a first signal and a second signal indicating a reverse translation order in descending order to front and rear sub translation units including specific position information in the original sentence or sub translation unit and divided based on the specific position ( 3020) Receiving the first signal or the second signal from the input unit, and recovers the original text according to the first signal or the second signal. Division of the cows into the translation unit, and a step 3030 to determine the translation word order sequence number of the divided said small translation unit.
  • a method or function for generating a translation sequence pattern comprising a designated translation sequence number (5) is a method for determining the translation sequence of the small translation units for the original sentence while dividing the above original sentence into small translation units which are translation subunits.
  • the method may further include generating a translation word order pattern including a plurality of small translation unit division patterns 300 and a translation word sequence number 5 assigned to each of the plurality of small translation unit division patterns 300.
  • the plurality of small translation unit division patterns 300 and the plurality of small translation unit division patterns for dividing an original sentence into a plurality of small translation units from sentence to sentence through morphological analysis and part-of-speech tagging 300
  • Generating small translations and translations for each of the small translation units by generating the translation sentences by sorting the small translation sentences according to the translation order assigned to the small translation units (3040), original sentence, Displaying (3050) the small translation units, and a translation sentence sequence number assigned to each of the small translation units, a translation sentence to a user, and receiving an input from the user; And receiving the input from the user, and if the input is the first signal or the second signal, perform steps 3030, 3040, and 3050, and if the input is not the first signal or the second signal, the divided small translation units And a morpheme sequence obtained by stemming the original sentence into a plurality of small translation units from the beginning to the end of the original sentence converted through morphological analysis and part-of-speech tagging from the translation order assigned to the small translation units.
  • Generating a translation word order pattern including a plurality of small translation unit division patterns 300 including a part-of-speech and a translation word sequence number 5 assigned to each of the plurality
  • the original sentence in the translation apparatus for translating the original sentence 1 into the translation sentence 2, the original sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units is determined.
  • the method or function includes the step 3012 of dividing the original text into small translation units and determining the translation order of the small translation units between steps 3010 and 3020 of FIG. 7C, and displaying the determined translation order to the user.
  • Step 3018 may further include.
  • the step 3012 of dividing the original sentence into small translation units and determining the translation order of the small translation units may be performed by retrieving the translation order pattern DB (3014) and dividing the original sentence. And determining 3016 in accordance with the word order determination rule.
  • steps 3014 and 3016 may be performed in either or both of the steps. If the first signal or the second signal is received from an input unit after displaying the determined translation order to the user (3018), the original text is divided into small translation units and the translation order of the small translation units is determined. A plurality of small translation unit division patterns 300 determined in operation 3012 and a translation word order 5 assigned to each of the plurality of small translation unit division patterns are reset.
  • the small translations and the translation generation steps 2050 and 3040 may be omitted according to an embodiment.
  • the translation order pattern according to the present invention has been described using English as the original language and Korean as the target language.
  • the present invention is not limited to English and Korean, and may be utilized, for example, between other original languages and target languages between deadlocks, refractive words, and isolated words.
  • the translation word order pattern of the present invention can be applied to translation not only between Japanese and Chinese but also other languages such as German and Spanish.

Abstract

A translation order pattern data structure stored in a computer-readable storage medium is used for translation in a translation apparatus for translating a source language sentence into a target language sentence, and comprises: small translation unit segmentation pattern data for segmenting an entire source language sentence from beginning to end into a plurality of small translation units - wherein the small translation unit segmentation pattern data include one or more parts of speech in a morpheme string obtained by morpheme parsing of the source language sentence -; and translation order number data each being specified for each of the plurality of small translation unit segmentation pattern data.

Description

[규칙 제37.2조에 의해 ISA가 부여한 발명의 명칭] 원문의 번역어순을 결정하는 데이터 구조, 상기 구조를 생성하는 프로그램 및 이를 저장하는 컴퓨터 판독가능 저장매체[Name of Invention Granted by ISA pursuant to Article 37.2 of the Rule] A data structure for determining the order of translation of the original text, a program for generating the structure, and a computer-readable storage medium storing the same.
본 발명은 원문 언어 문장을 대상 언어 문장으로 번역할 때, 원문문장과 번역문장 사이의 번역 어순을 결정하는 것에 관한 것이다. 구체적으로, 원문문장을 분할하여 번역할 경우 번역이 일치하는 번역 소단위들인 소번역단위들로 분할하고, 소번역단위들 간의 번역 어순을 결정하는 번역어순패턴 데이터에 관한 것이다. 특히, 원문문장의 번역 소단위들을 구성하는 소번역단위들에 번역어순을 결정하는 번역어순패턴 데이터 구조, 및 이를 생성하는 컴퓨터 실행가능 명령어를 저장한 컴퓨터 판독가능 저장매체, 및 이러한 명령어들을 이용한 번역 프로그램에 관한 것이다. The present invention relates to determining a translation word order between an original sentence and a translated sentence when the original language sentence is translated into a target language sentence. Specifically, when the original text is divided and translated, the present invention relates to translation word order pattern data that is divided into small translation units that are translation subunits that match the translation, and determines the translation word order between the small translation units. In particular, a translation order pattern data structure for determining a translation order in small translation units constituting translation subunits of an original sentence, a computer readable storage medium storing computer executable instructions for generating the translation sequence, and a translation program using the instructions It is about.
근래에 통계 번역 및 머신 러닝을 이용하여 원문언어를 대상언어로 자동번역하는 기술이 크게 각광받고 있다. 그러나, 교착어와 고립어, 또는 교착어와 굴절어처럼 원문 언어와 대상 언어가 번역 어순이 상이한 언어 간의 자동번역은 단문과 짧은 문장일 때 잘되지만, 복문과 같이 긴 문장일 때 번역 정확도가 떨어지며, 특히 and/or 접속사가 문장에 존재하면 번역 정확도가 더 떨어진다. Recently, the technology of automatically translating the original language into the target language by using statistical translation and machine learning has gained much attention. However, automatic translation between a source language and a target language where the source language and the target language are different in translation order, such as deadlock and isolation, or deadlock and inflection, works well in short and short sentences, but in the case of long sentences such as compound sentences, the translation accuracy is poor, and // If the or conjunction is present in a sentence, the translation accuracy is lower.
또한, 번역가는 원문문장을 동일한 의미를 전달하는 여러 개의 번역문장으로 번역할 수 있는 반면, 기존의 자동번역은 동일한 문장에 대해 원문문장을 이해할 수 있는 보편적인 하나의 번역 결과만을 출력하기 때문에, 번역가, 특히 전문적인 번역가의 입장에서 자동번역 과정에서 자신의 번역 개성을 반영시키면서 자동번역할 수 없는 문제점을 가지고 있다. In addition, the translator can translate the original text into multiple translations that convey the same meaning, whereas the existing automatic translation outputs only one universal translation result that can understand the original text for the same sentence. In particular, a professional translator has a problem that cannot be automatically translated while reflecting his / her translation personality in the process of automatic translation.
본 발명은 원문 언어로 된 원문문장에서 대상 언어로 된 번역문장으로 번역하기 전에 원문 언어를 형태소 분석하고 원문문장을 번역 소단위인 소번역단위들로 분할하여 번역 어순을 미리 결정해 줌으로써 번역 전에 사용자에게 미리 정해진 번역어순을 제공하는데 목적이 있다. The present invention prior to translation by analyzing the original language stemming from the original sentence in the original language to the translation sentence in the target language and by dividing the original sentence into small translation units which are translation subunits to determine the translation word order in advance. The purpose is to provide a predetermined translation order.
일본어와 한국어와 동일한 계통의 언어 간의 번역은 번역 어순이 유사하기 때문에, 자동번역 정확도가 매우 높다. 그러나, 영어와 한국어, 중국어와 한국어와 같이 이종 계통의 언어 간의 번역은 번역 어순이 상이하기 때문에, 자동번역의 정확도가 상대적으로 낮다. 본 발명은 번역어순이 상이한 언어 간의 자동번역의 정확도를 높이기 위해, 일본어와 한국어와 같은 동종 계통의 언어 간에 자동번역할 경우의 수준까지 자동번역의 정확도를 높일 수 있도록 번역 어순을 결정해 주는데 목적이 있다. Since the translation order is similar between Japanese and Korean, the automatic translation accuracy is very high. However, the translation between heterogeneous languages such as English and Korean, Chinese and Korean differs in the order of translation, so that the accuracy of automatic translation is relatively low. The purpose of the present invention is to determine the translation order in order to increase the accuracy of automatic translation up to the level of automatic translation between languages of the same system such as Japanese and Korean, in order to increase the accuracy of automatic translation between languages with different translation order. have.
본 발명의 일 양태에 따른 원문문장을 번역문장으로 번역하기 위한 번역 장치에서 번역을 위해 이용되는, 컴퓨터 판독 가능 저장 매체에 저장된 번역어순패턴 데이터 구조는 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 소번역단위 분할패턴 데이터들 - 상기 소번역단위 분할패턴 데이터들은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -, 및 복수의 소번역단위 분할패턴 데이터들 각각에 지정된(specify) 번역어순 순번 데이터들을 포함한다. A translation word order pattern data structure stored in a computer-readable storage medium, which is used for translation in a translation apparatus for translating an original sentence into a translated sentence according to an aspect of the present invention, includes a plurality of small translation units from the first sentence to the end of the original sentence. Sub-translational unit division pattern data for dividing into sub-translation units, wherein the sub-translational unit division pattern data includes one or more parts of morphemes obtained by morphological analysis of the original sentence, and each of the plurality of sub-translation unit division pattern data. Contains the specified translation sequence number data.
본 발명의 일 양태에서, 컴퓨터 판독 가능 저장 매체에 저장된 번역어순패턴 데이터 구조는 소번역단위 분할패턴 데이터들에 의해 원문문장으로부터 분할된 소번역단위들을 번역하고, 번역어순 순번 데이터들에 따라 상기 소번역단위들 각각의 번역문인 소번역문들을 정렬하여서 원문문장의 번역문장을 출력한 출력 데이터를 더 포함한다. In one aspect of the present invention, a translation order pattern data structure stored in a computer-readable storage medium translates small translation units divided from original text by small translation unit division pattern data, and the small translation units are translated according to translation order sequence data. It further includes output data outputting the translated text of the original text by sorting the small translations of each translation unit.
본 발명의 일 양태에서, 소번역단위 분할패턴 데이터들 각각은 각 소번역단위로의 분할의 시작을 지시하는 분할패턴 시작부, 중간부 및 각 소번역단위로의 분할의 종료를 지시하는 분할패턴 종료부로 구성되고, 소번역단위 분할패턴 시작부는 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 하나 이상의 품사를 포함하고, 분할패턴 종료부는 소번역단위 분할패턴 데이터의 형태소열 중 마지막에 위치된 하나 이상의 품사를 포함하고, 상기 중간부가 생략될 수 있다.In one aspect of the present invention, each of the small translation unit division pattern data includes a division pattern instructing the start of division into each small translation unit, a division part indicating the beginning of the division, and an end of division into each small translation unit. A small translation unit segmentation pattern beginning portion comprising one or more parts of speech located at the beginning of the morphological sequences of the small translation unit segmentation pattern data, and a segmentation pattern ending portion being positioned last in the morphological sequence of the small translation unit segmentation pattern data It may include one or more parts of speech, and the middle portion may be omitted.
본 발명의 일 양태에서, 소번역단위 분할패턴 시작부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 품사로 구성되고, 분할패턴 종료부는 소번역단위 분할패턴 데이터의 형태소열 중 마지막에 위치된 품사로 구성되고, 상기 중간부가 생략될 수 있다.In one aspect of the present invention, the small translation unit division pattern start part is composed of parts of speech located at the front of the morphological sequences of the small translation unit division pattern data, and the division pattern end part is the last of the morphological sequences of the small translation unit division pattern data. Consists of positioned parts of speech, and the intermediate portion may be omitted.
본 발명의 일 양태에서, 소번역단위 분할패턴 시작부는 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 하나 또는 두개로 구성되고, 분할패턴 종료부는 소번역단위 분할패턴 데이터의 형태소열 중 마지막 명사로부터 소번역단위 분할패턴 데이터의 형태소열의 끝까지의 품사로 구성되고, 중간부가 생략될 수 있다. In one aspect of the present invention, the beginning of the sub-translational unit division pattern consists of one or two of the morphological sequences of the sub-translational unit division pattern data, and the end of the division pattern is the last of the morphological sequences of the sub-translation unit division pattern data. It consists of parts of speech from the noun to the end of the morphological sequence of the sub-translational unit division pattern data, and the middle part may be omitted.
본 발명의 일 양태에서, 소번역단위 분할패턴 데이터들에 의해 분할될 소번역단위의 소번역문의 문미에 위치될 조사, 어미 또는 전치사 대역어를 포함하는 문미 번역어가 상기 복수의 소번역단위 분할패턴 데이터들 중 하나 이상에 지정된다. In an aspect of the present invention, a sentence translation including a survey, ending, or prepositional band word to be positioned at the end of a small translation unit of a small translation unit to be divided by the small translation unit division pattern data includes the plurality of small translation unit division pattern data. Specified in one or more of these.
본 발명의 일 양태에 따른 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하고 상기 소번역단위들로부터 소번역단위 분할패턴 데이터들(300)을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 생성하는 단계;를 포함한다. The original text sentence (1) according to an aspect of the present invention is used for translation in a translation apparatus for translating the translated text sentence (2), and the original text sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units A computer readable storage medium having stored thereon computer executable instructions for generating a translation order pattern data structure for determining a structure, the computer executable instructions comprising dividing an original sentence into a plurality of small translation units from the beginning to the end and Generating small translation unit division pattern data (300) from translation units, wherein the small translation unit division pattern data (300) includes at least one part-of-speech (part of speech) morphologically analyzed from the original text; And generating translation word order data (5) designated in each of the plurality of small translation unit division pattern data (300).
본 발명의 일 양태에서, 번역어순패턴 데이터 구조를 생성하기 위한 명령어들을 포함하는 컴퓨터 판독 가능한 저장 매체로서 상기 명령어들은 소번역단위 분할패턴 데이터들에 의해 원문문장으로부터 분할된 소번역단위들을 번역하고, 번역어순 순번 데이터들에 따라 상기 소번역단위들 각각의 번역문인 소번역문들을 정렬하여서 상기 원문문장의 번역문장을 출력하는 단계를 더 포함한다. In an aspect of the present invention, a computer-readable storage medium including instructions for generating a translation order pattern data structure, the instructions translate small translation units divided from original text by small translation unit division pattern data, And sorting the small translations, which are translations of each of the small translation units, according to the translation order number data, and outputting a translation sentence of the original sentence.
본 발명의 일 양태에 따른 원문문장을 번역문장으로 번역하기 위한 번역 장치에서 번역을 위해 이용되는, 컴퓨터 판독 가능 저장 매체에 저장된 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 소번역단위 분할패턴 데이터들을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -, 및 복수의 소번역단위 분할패턴 데이터들 각각에 지정된 번역어순 순번 데이터들를 생성하는 단계를 수행한다. Computer readable storage of computer executable instructions for generating a translation order pattern data structure stored in a computer readable storage medium used for translation in a translation apparatus for translating original text into translated text according to an aspect of the present invention. Generating, by the computer executable instructions, small translation unit division pattern data for dividing the original sentence into a plurality of small translation units from the beginning to the end; the small translation unit division pattern data stemming the original sentence. Generating at least one part-of-speech of the analyzed morpheme sequences; and generating translation order sequence data specified in each of the plurality of sub-translational unit division pattern data.
본 발명의 일 양태에 따른 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며, 원문문장을 번역 소단위들로 분할하고 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 a) 원문문장을 불러오는 단계; b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; 및 c) 상기 번역어순패턴 데이터 구조를 포함하는 번역어순패턴 DB(160)에 저장된 번역어순패턴의 소번역단위 분할패턴들의 형태소열들과 원문문장의 형태소열을 비교하여 매칭되는 번역어순패턴을 불러오고 상기 불러온 번역어순패턴의 소번역단위 분할패턴에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계;를 수행한다. Used for translation in a translation apparatus for translating an original sentence (1) into a translation sentence (2) according to an aspect of the present invention, for dividing the original sentence into translation subunits and for determining the translation order of the small translation units A computer readable storage medium having stored thereon computer executable instructions for generating a translation pattern data structure, the computer executable instructions comprising: a) retrieving an original text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm; And c) comparing the morphological sequences of the small translation unit segmentation patterns of the translated word order pattern stored in the translated word order pattern DB 160 including the translated word order pattern data structure and the morphological sequences of the original sentence to retrieve a matching word order pattern. And dividing the original text into small translation units according to the small translation unit division pattern of the imported translation word order pattern and determining the translation word order of the small translation units.
본 발명의 일 양태에 따른 원문문장을 번역문장으로 번역하기 위한 번역 장치에서 이용되는, 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 a) 원문문장을 불러오는 단계; b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; 및 c) 미리 결정된 특정 품사들 사이를 분할하는 분할 쌍을 포함하는 분할 어순 결정 규칙으로서, 상기 분할 쌍에 의해 분할될 앞뒤의 소번역단위들이 올림 차순의 순차 번역어순인지 또는 내림 차순의 역순 번역어순인지를 지정한, 분할 어순 결정 규칙에 따라 상기 형태소 분석된 원문문장을 복수의 소번역단위들로 분할하고 상기 소번역단위들에 번역어순을 결정하는 단계; d) 상기 결정된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 사용자에게 표시하고 사용자로부터 입력을 수신하는 단계; 및 e) 사용자로부터 입력을 수신하여 사용자에게 표시된 소번역단위들에 지정된 번역어순 순번과 비교하고, 사용자에게 표시한 소번역단위들에 지정된 번역어순 순번이 수정되었다면 사용자가 입력한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴 데이터들 및 상기 복수의 소번역단위 분할패턴 데이터들 각각에 지정된 번역어순 순번을 포함하는 번역어순패턴 데이터 구조를 생성하고 번역어순패턴 DB에 저장하고, 수정되지 않았다면 사용자에게 표시한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 각각 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴 데이터들으로서, 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴 데이터들, 및 상기 복수의 소번역단위 분할패턴 데이터들 각각에 지정된 번역어순 순번을 포함하는 번역어순패턴 데이터 구조를 생성하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 수행한다. Computer executable instructions for dividing the original text into small translation units, which are translation subunits, used in a translation apparatus for translating the original text sentence into a translation sentence, and for determining the translation order of the small translation units according to an aspect of the present invention. Computer readable storage medium having stored thereon the computer executable instructions comprising: a) retrieving a text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm; And c) a division word order division rule for dividing between predetermined predetermined parts-of-speech, wherein the preceding and subsequent sub-translation units to be divided by the division pair are in ascending or descending translation order. Dividing the morphologically analyzed original sentence into a plurality of small translation units and determining a translation word order in the small translation units according to a division word order determination rule specifying recognition; d) displaying the determined small translation units and the translation order assigned to the small translation units to the user and receiving an input from the user; And e) receive input from the user and compare it with the translation order assigned to the small translation units displayed to the user, and if the translation order assigned to the small translation units displayed to the user has been modified, A translation including a plurality of small translation unit division pattern data converted through morphological analysis and part-of-speech tagging from the translation language sequence number assigned to the small translation units, and a translation language sequence number assigned to each of the plurality of small translation unit division pattern data A plurality of transformed words are generated through morphological analysis and part-of-speech tagging by generating word order data structures and storing them in a translation word order pattern DB, and if they have not been modified, from small translation units displayed to the user and translation word sequence numbers assigned to the small translation units. Small translation unit segmentation pattern data. A translation order sequence data structure that generates a plurality of small translation unit division pattern data including one or more parts of speech, and a translation sequence sequence data structure including a translation sequence number assigned to each of the plurality of small translation unit division pattern data. To generate;
본 발명의 일 양태에 따른 원문문장을 번역문장으로 번역하기 위한 번역 장치에서 번역을 위해 이용되는, 사용자 신호에 따라 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 a) 원문문장을 불러오고 사용자에게 표시하는 단계; b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; c) 상기 원문문장 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는 단계; d) 상기 제 1 신호 또는 상기 제 2 신호에 따라, 상기 원문문장을 복수의 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계; 및 e) 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 각각 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴 데이터들로서, 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴 데이터들, 및 상기 복수의 소번역단위 분할패턴 데이터들 각각에 지정된 번역어순 순번을 포함하는 번역어순패턴 데이터 구조를 생성하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 포함한다. According to a user signal, the original text is divided into small translation units which are translation subunits, and the translation order of the small translation units is used for translation in the translation apparatus for translating the original text sentence into a translation sentence according to an aspect of the present invention. A computer readable storage medium having stored computer executable instructions for determining, the computer executable instructions comprising: a) retrieving a text sentence and displaying it to a user; b) stemming the original text and tagging parts of speech into the analyzed morphemes, wherein stemming the original text and tagging the parts of speech comprises stemming the original code and tagging the original code; c) a first signal including the specific position information in the original sentence and indicating a sequential translation order in ascending order to the front and rear sub-translation units divided based on the specific position, and the specific in the original sentence or sub-translation unit Receiving a second signal including position information and indicating a reverse translation order in descending order to front and rear sub-translation units divided based on the specific location; d) dividing the original text into a plurality of small translation units according to the first signal or the second signal and determining a translation word order of the small translation units; And e) a plurality of sub-translational unit division pattern data converted through the morphological analysis and the part-of-speech tagging from the divided sub-translation units and the translation word order assigned to the sub-translation units, respectively, wherein the morphological sequence of the original sentence is morphologically analyzed. A translation word order pattern data structure for generating a plurality of small translation unit division pattern data including at least one part of speech, and a translation order sequence data structure including a translation order number assigned to each of the plurality of small translation unit division pattern data Generating a structure.
본 발명의 일 양태에서, 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계는 상기 제 1 신호에 따라 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 올림차순의 순차 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키고, 상기 제 2 신호에 따라 상기 제 2 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키는 단계를 포함한다. In an aspect of the present disclosure, the step of dividing the original text into small translation units and determining the translation order of the small translation units may include: a small translation unit to which a specific position indicated by the first signal belongs according to the first signal; Alternatively, the original sentence is divided into two front and rear sub-translation units based on the specific position, and the sequential order of ascending order is determined in the two sub-translation units divided, and the sequential order between the divided front and rear sub-translation units is divided. It is determined by the translation order number originally possessed by the small translation unit and the translation order number by which 1 is increased to the translation language sequence, and other small translation unit other than the small translation unit to which the specific position indicated by the first signal belongs. Another translation unit having the same or greater order if the translation order is the same or greater than the increased order Increase the number of translation order specified in the field by 1, and convert the sub-translation unit or original sentence belonging to a specific position indicated by the second signal according to the second signal into two front and rear sub-translation units based on the specific position. Determine the reverse order of the descending order of the two sub-translational units that are divided, and the reverse order between the divided front and rear sub-translational units is increased by 1 to the translation order originally possessed by the small translation unit before the division. The translation order number of the small translation unit other than the small translation unit to which the specific position indicated by the first signal belongs is determined by the word order number and the translation order number originally possessed, and is equal to or greater than the increased order number. Increasing the translation order assigned to other small translation units having the same or larger order by 1 It should.
본 발명의 일 양태에 따른 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 a) 원문문장을 불러오는 단계; b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; 및 c) 상기 번역어순패턴 데이터 구조를 포함하는 번역어순패턴 DB에 저장된 번역어순패턴 데이터의 소번역단위 분할패턴 데이터들의 형태소열들과 원문문장의 형태소열을 비교하여 매칭되는 번역어순패턴 데이터를 불러오고 상기 불러온 번역어순패턴 데이터의 소번역단위 분할패턴 데이터에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계; d) 상기 원문문장, 상기 소번역단위들, 및 상기 소번역단위들의 번역어순 순번을 사용자에게 표시하는 단계; e) 상기 원문문장 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는 단계; 및 f) 상기 원문문장 또는 상기 c)에 의해 결정된 소번역단위들을 상기 제 1 신호 또는 상기 제 2 신호에 따라 복수의 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계;를 포함한다. A computer-readable storage medium storing computer executable instructions for dividing an original text into sub-translation units that are translation sub-units and determining translation order of the sub-translation units, according to an aspect of the present invention. Heard a) retrieving the original text; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm; And c) comparing the morphological sequences of the sub-translational unit division pattern data of the translated word-order pattern data stored in the translated word-order pattern DB including the translated word-order pattern data structure to retrieve matching word-order pattern data. Dividing the original text into small translation units according to the small translation unit division pattern data of the imported translation word order pattern data and determining the translation word order of the small translation units; d) displaying the original sentence, the small translation units, and the translation order of the small translation units to a user; e) a first signal which includes the specific position information in the original text and indicates the sequential translation order in ascending order to the front and rear sub-translation units divided based on the specific position, and the specific in the original sentence or sub-translation unit Receiving a second signal including position information and indicating a reverse translation order in descending order to front and rear sub-translation units divided based on the specific location; And f) dividing the small translation units determined by the original sentence or c) into a plurality of small translation units according to the first signal or the second signal and determining a translation word order of the small translation units; It includes.
본 발명의 일 양태에 따른 번역 프로그램은 전술된 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체에 저장된 명령어들을 이용하여 번역을 수행한다. A translation program according to an aspect of the present invention is stored in a computer-readable storage medium that divides the above original text into small translation units which are translation subunits and stores computer executable instructions for determining the translation order of the small translation units. Perform the translation using the commands.
본 발명의 일 양태에 따른 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체를 다운로드 가능한 형태로 배포하는 서버로서, 상기 컴퓨터 실행가능 명령어들은, 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하고 상기 소번역단위들로부터 소번역단위 분할패턴 데이터들(300)을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및 상기 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 생성하는 단계;를 수행한다. The original text sentence (1) according to an aspect of the present invention is used for translation in a translation apparatus for translating the translated text sentence (2), and the original text sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units A server for distributing a computer readable storage medium storing computer executable instructions for generating a translation pattern pattern data structure for download in a downloadable form, wherein the computer executable instructions include: Dividing into small translation units and generating small translation unit split pattern data 300 from the small translation units-wherein the small translation unit split pattern data 300 is one of the morphemes obtained by stemming the original sentence. Includes more than part of speech; And generating translation word order data (5) designated in each of the plurality of small translation unit division pattern data (300).
번역어순패턴 데이터는 동종 계통언어들(예, 한국어 및 일본어) 간의 번역어순 뿐만 아니라, 굴절어, 교착어 및 독립어와 같이 서로 어순이 다른 이종 계통언어 간에 번역어순을 결정해 줄 수 있다. 본 발명의 번역어순패턴 데이터는 영어와 한국어, 영어와 일본어와 같이, 이종 계통의 언어에 대해 마치 동일언어의 어순처럼 처리해주는 역활을 하기 때문에, 이종 계통 언어들 간의 자동 번역의 정확성을 동일 계통 언어의 자동 번역 정확도까지 끌어 올릴 수 있는 효과가 있다. The translation order pattern data may determine the translation order between heterogeneous languages, such as refractive art, interlocking words, and independent languages, as well as translation order between homogeneous languages (eg, Korean and Japanese). Since the translation order pattern data of the present invention plays a role of treating the languages of heterogeneous lines like the word order of the same language, such as English and Korean, English and Japanese, the accuracy of automatic translation between heterogeneous line languages is the same. It can increase the accuracy of automatic translation.
본 발명은 또한 번역가가 원문문장을 번역 소단위인 소번역단위들로 분할하고 이들 소번역단위들에 번역어순 순번을 결정한 경우 이를 번역어순패턴 데이터 구조로 저장하여 사용자가 저장한 번역어순을 차후 번역에서 이용할 수 있다. 또한, 번역 숙련자 등 다른 사람이 축적한 번역어순 DB를 이용하여 번역 초보자가 번역 전에 소번역단위로 분할된 번역어순에 따라 번역문장을 완성할 수 있다.According to the present invention, when the translator divides the original text into sub-translational units which are translation subunits and determines the translation sequence order in these sub-translation units, the translator stores the translation sequence stored by the user in a later translation. It is available. In addition, a translation beginner may complete a translation sentence according to a translation word order divided into small translation units before translation using a translation word order DB accumulated by another person such as a translation expert.
본 발명에서 번역어순패턴 데이터 구조는 소번역단위 분할패턴 데이터 및 번역어순 순번 데이터를 포함하고, 소번역단위 분할패턴 데이터 및 번역어순 순번 데이터는 번역어순 데이터 구조에서 각각 소번역단위 분할패턴에 대한 정보를 갖는 데이터 및 번역어순 순번에 대한 정보를 갖는 데이터를 의미하는 용어이다. 이하, "번역어순패턴 데이터", "소번역단위 분할패턴 데이터" 및 "번역어순 순번 데이터"는 본 명세서에서 문맥에 따라, "번역어순패턴", "소번역단위 분할패턴", 및 "어순"과 혼용하여 사용된다. In the present invention, the translation order pattern data structure includes the small translation unit division pattern data and the translation order sequence data, and the small translation unit division pattern data and the translation order sequence data are information on each small translation unit division pattern in the translation order structure. The term refers to data having data and data having information on a sequence number of a translation. Hereinafter, "translation order sequence data", "small translation unit division pattern data" and "translation order sequence data" are referred to herein as "translation order sequence pattern", "small translation unit division pattern", and "word order". Used in combination with
또한, 본 발명의 번역어순패턴은 원문문장을 번역 소단위인 소번역단위들로 분할하는 분할패턴들에 번역어순 순번을 지정한 것이기 때문에, 대상언어에 상관없이 이용가능하다. 예를 들어, 원문 언어가 영어이고 대상 언어가 한국어인 경우 영어 원문문장을 분석하여 원문인 영어에 대한 한국어 번역어순을 지정한 것이므로, 한국어와 번역 어순이 동일한 일본어 등 동종 계통 언어에 번역문이 없는 상태에서도 적용할 수 있다. 즉, 영어를 한국어로 번역할 때 얻어진 번역어순패턴을 영어로부터 일본어로의 번역에서 그대로 이용할 수 있다.In addition, the translation word order pattern of the present invention can be used regardless of the target language because the translation order is assigned to the division patterns for dividing the original text into small translation units which are translation subunits. For example, if the original language is English and the target language is Korean, the original English sentence is analyzed and the Korean translation order for the original English is specified. Applicable That is, the translation order pattern obtained when translating English into Korean can be used as it is in translation from English to Japanese.
다수의 다른 양태는 이들 및 본 발명의 다른 실시예들에 따라 제공된다. 본 발명의 실시예들의 다른 특징 및 양태는 다음의 상세한 설명, 첨부된 청구범위, 및 첨부 도면으로부터 더욱 완전히 명백해질 것이다. Many other aspects are provided in accordance with these and other embodiments of the present invention. Other features and aspects of embodiments of the present invention will become more fully apparent from the following detailed description, the appended claims, and the accompanying drawings.
도 1a는 본 발명의 실시예에 따라 원문을 번역문으로 번역하는 번역 장치에서 포함된 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체, 프로그램, 방법 또는 장치에 대한 블록도이다. FIG. 1A is a diagram illustrating an example of dividing an original text included in a translation apparatus for translating an original text into a translation into small translation units, which are translation subunits, and storing computer executable instructions for determining the translation order of the small translation units, according to an embodiment of the present invention. A block diagram of a computer readable storage medium, program, method or apparatus.
도 1b는 도 1a의 소번역단위 및 번역어순 결정유닛에 대한 확대도이다. FIG. 1B is an enlarged view of a small translation unit and a translation word order determining unit of FIG. 1A.
도 1c는 도 1a의 소번역문 및 번역문 생성유닛에 대한 확대도이다. 1C is an enlarged view of a small translation and a translation generation unit of FIG. 1A.
도 1d는 도 1a의 사용자 인터페이스유닛에 대한 확대도이다. FIG. 1D is an enlarged view of the user interface unit of FIG. 1A.
도 2a는 본 발명의 실시예들을 위해 사용되는 원문문장과 번역문장을 도시한다. 2A illustrates the original and translated sentences used for embodiments of the present invention.
도 2b는 본 발명에 따른 복문 분할 어순 규칙의 일 실시예를 도시한다. 2B illustrates one embodiment of a compound sentence division word order according to the present invention.
도 2c는 본 발명에 따른 단문 분할 어순 규칙의 일 실시예를 도시한다. 2C illustrates one embodiment of a short division word order rule in accordance with the present invention.
도 2d는 도 2a의 원문을 형태소 분석하고 품사를 태깅한 실시예를 도시한다. FIG. 2D illustrates an embodiment morphologically analyzing the original text of FIG. 2A and tagging the part-of-speech.
도 3a 내지 3h는 분할 어순 규칙에 따라 도 2a의 원문문장을 복수의 소번역단위들로 분할하고 각 소번역단위들에 번역어순 순번을 결정한 후 소번역문을 생성하여 번역문을 생성하는 일련의 과정에 대한 실시예를 도시한다. 3A to 3H illustrate a process of dividing the original text of FIG. 2A into a plurality of small translation units according to a division word order rule, determining a translation order number for each small translation unit, and then generating a small translation. An example is shown.
도 3i는 도 3c에서 원문문장을 3개의 소번역단위들로 분할하고 번역어순 순번을 결정한 후 각 소번역단위들에 대한 소번역문을 번역하여 번역문을 완성한 실시예를 도시한다. FIG. 3I illustrates an embodiment of dividing an original sentence into three small translation units in FIG. 3C, determining a translation order, and translating a small translation for each small translation unit to complete a translation.
도 4a은 본 발명에 따른 번역어순패턴 데이터의 일 실시예을 도시한다. 4A illustrates an embodiment of translation word pattern data according to the present invention.
도 4b는 본 발명의 실시예에 따른 번역어순패턴 데이터 및 번역어순패턴 데이터를 기초로 분할패턴 중간부를 생략한 확장된 번역어순패턴 데이터의 실시예들이다. 4B illustrates exemplary embodiments of expanded word order pattern data in which a middle portion of a split pattern is omitted based on the word order pattern data and the word order pattern data according to an embodiment of the present invention.
도 4c는 분할패턴 데이터들 및 이에 지정된 순번 데이터들을 포함하는 번역어순패턴 데이터가 문미번역어를 더 포함하는 실시예을 도시한다. FIG. 4C illustrates an embodiment in which the translation order pattern data including the split pattern data and the order data specified therein further includes a sentence.
도 4d는 본 발명의 실시예에 따라 도 3b 내지 도 3f의 소번역단위들로의 분할 후 생성되는 번역어순패턴 데이터들의 실시예들을 도시한다. FIG. 4D illustrates embodiments of translation word pattern data generated after division into small translation units of FIGS. 3B to 3F according to an embodiment of the present invention.
도 4e는 본 발명의 실시예에 따라 도 4a의 번역어순패턴으로부터 확장가능한 번역어순패턴 데이터들의 실시예들을 도시한다. FIG. 4E illustrates embodiments of translation order pattern data extendable from the translation order pattern of FIG. 4A in accordance with an embodiment of the present invention.
도 5a 내지 도 5m은 본 발명의 실시예에 따라 소번역단위 및 번역어순 결정 유닛에서 원문에 대해 소번역단위들로 분할하고 각 소번역단위들에 번역어순 순번을 결정한 것을 도시한다. 5A to 5M illustrate that the translation order is divided into small translation units for the original text in the small translation unit and the translation order determination unit, and the translation order is determined for each small translation unit according to an embodiment of the present invention.
도 5n은 본 발명의 일 실시예에 따라 원문에 대해 소번역단위들로 분할하고 각 소번역단위들에 번역어순 순번을 결정한 후 소번역문 및 번역문 생성 유닛에서 자동번역을 통해 번역문장이 생성한 것을 도시한다. Figure 5n is divided into small translation units for the original text according to an embodiment of the present invention after determining the translation order number for each small translation unit after the translation is generated by the automatic translation in the small translation and translation generation unit Illustrated.
도 6a 내지 도 6h는 도 2a의 원문이 도 4a 내지 도 4d의 번역어순패턴들로 번역어순패턴 DB에 저장된 후 저장된 번역어순패턴 데이터를 기초로 번역문장을 생성하는 일련의 과정을 도시한다. 6A to 6H illustrate a process of generating a translation sentence based on the stored translation order pattern data after the original text of FIG. 2A is stored in the translation order pattern DB as the translation order patterns of FIGS. 4A to 4D.
도 7a는 본 발명의 실시예에 따라 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법의 일 실시예이다. FIG. 7A illustrates an embodiment of a method for dividing an original text into small translation units, which are translation subunits, and determining a translation word order of the small translation units, according to an exemplary embodiment of the present invention.
도 7b는 본 발명의 실시예에 따라 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법 및 이를 이용한 번역 장치의 일 실시예이다. FIG. 7B illustrates an embodiment of a method and a translation apparatus using the same for dividing an original text into small translation units, which are translation subunits, and determining the translation word order of the small translation units, according to an embodiment of the present invention.
도 7c는 본 발명의 실시예에 따라 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법 및 이를 이용한 번역 장치의 다른 실시예이다. 7C is another embodiment of a method for dividing an original text into small translation units, which are translation subunits, and determining a translation word order of the small translation units, according to an embodiment of the present invention, and a translation apparatus using the same.
1. 번역어순패턴1. Translation Patterns
원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되는 컴퓨터 판독 가능 저장 매체에 저장된 번역어순패턴 데이터 구조는 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 소번역단위 분할패턴 데이터들(300), 및 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 포함한다. 또한, 이것은 소번역단위 분할패턴 데이터들(300)에 의해 원문문장(1)으로부터 분할된 소번역단위들을 번역하고, 상기 번역어순 순번에 따라 상기 소번역단위들 각각의 번역문인 소번역문들(4)을 정렬하여서 원문문장(2)의 번역문장(2)을 출력한 출력 데이터를 포함한다. '원문문장의 번역문장(2)을 출력한 출력 데이터'는 컴퓨터 판독가능한 저장매체, 예를 들어 메모리, 하드 디스크 등에 저장될 수 있고, 사용자에게 표시하기 위해 표시 유닛(141)으로, 또는 번역어순패턴 데이터를 생성하기 위해 번역어순패턴 생성 유닛(150), 또는 소번역메모리 및 번역메모리 DB로 송신될 수 있다. The translation order pattern data structure stored in a computer-readable storage medium used for translation in a translation apparatus for translating an original sentence (1) into a translation sentence (2) is composed of a plurality of small translation units from the beginning to the end of the original sentence. The small translation unit division pattern data 300 for division, and the translation order sequence data 5 specified in each of the plurality of small translation unit division patterns 300 are included. In addition, it translates the small translation units divided from the original sentence sentence 1 by the small translation unit division pattern data 300, and the small translation sentences (4) which are the translations of each of the small translation units according to the translation order. ) And the output data obtained by outputting the translated sentence 2 of the original sentence 2. The output data outputting the translated sentence 2 of the original sentence may be stored in a computer readable storage medium, for example, a memory, a hard disk, etc., and displayed on the display unit 141 for display to the user, or in the order of translation. In order to generate the pattern data, it may be transmitted to the translation word order pattern generation unit 150 or the small translation memory and the translation memory DB.
도 4a는 본 발명의 실시예에 따라 원문을 형태소 분석하고 도 3f에서 분할된 소번역단위들을 형태소열로 변환한 소번역단위 분할패턴들에 번역어순 순번을 지정한 번역어순패턴(401, 405) 및 번역어순패턴을 기초로 분할패턴 중간부를 생략한 확장된 번역어순패턴(402-404, 406-408)의 실시예들을 도시한다. 도 4a의 번역어순패턴의 첫번째 행(401)의 소번역단위 분할패턴들(300)에서 AR A A N N(301), V V AR A N(321), R AR D A N PP N(361), PL PP V A N PR(371), V PV SY(381) 및 D PP N RV SY(391)은 도 3f의 소번역단위들(30, 32, 36, 37, 38, 39)에 대한 6개의 소번역단위 분할패턴들(300)을 나타내며, 각 소번역단위 분할패턴들(300)에 번역어순 순번들(5)이 지정되어 있다. 본 발명의 번역 장치 또는 번역 프로그램에서 상기 소번역단위 분할패턴(300)들에 의해 분할된 소번역단위(3)들을 번역하고, 상기 번역어순 순번(5)들에 따라 상기 소번역단위(3)들 각각의 번역문인 소번역문들(4)을 정렬하면 상기 원문문장(1)의 번역문장(2)이 된다. 소번역단위들(3) 각각은 다른 원문문장을 소번역단위들로 분할할 때 사용하기 위해 형태소 분석 및 품사 태킹을 통해 소번역단위의 형태소열 중 하나 이상의 품사를 포함하는 소번역단위 분할패턴(300)으로 변환될 수 있다. 다른 실시예에서, 소번역단위들(3) 각각은 원문언어 단어, 부호 및 이들을 형태소 분석한 품사 중 하나 이상을 포함하는 소번역단위 분할패턴(300)으로 표시될 수 있다('for'를 포함하는 도 4b 두번째 행 참조). 4A is a translation word order pattern (401, 405) specifying a translation order in order for a morphological analysis of original texts and small translation unit division patterns obtained by converting the small translation units divided in FIG. 3F into morpheme sequences according to an embodiment of the present invention; Exemplary embodiments of the expanded translation order patterns 402-404 and 406-408, which omit the split pattern middle part based on the translation order patterns, are illustrated. In the small translation unit division patterns 300 of the first row 401 of the translation word order pattern of FIG. 4A, AR AANN 301, VV AR AN 321, R AR DAN PP N 361, PL PP VAN PR ( 371), V PV SY 381 and D PP N RV SY 391 are the six small translation unit division patterns for the small translation units 30, 32, 36, 37, 38, 39 of FIG. 300, translation word sequence numbers 5 are assigned to each of the small translation unit division patterns 300. FIG. The small translation unit (3) divided by the small translation unit division pattern (300) in the translation apparatus or translation program of the present invention is translated, and the small translation unit (3) according to the translation order (5) When the small translations 4 which are the translations of each of the two are arranged, the translated sentence 2 of the original sentence 1 is obtained. Each of the sub-translational units (3) includes a sub-translational unit division pattern that includes one or more parts of speech in the morphological sequence of the sub-translational unit through morphological analysis and part-of-speech tagging for use in dividing other original text into sub-translational units. 300). In another embodiment, each of the small translation units 3 may be represented by a small translation unit division pattern 300 including one or more of a source word, a code, and a part-of-speech that parsed them (including 'for'). 4b second row).
각각의 소번역단위 분할패턴(300)은 다른 문장을 형태소 분석한 형태소열 중 상기 소번역단위 분할패턴들 각각과 일치하는 형태소열이 존재할 경우 소번역단위로 분할하기 위한 분할 규칙으로 적용된다. 이 경우, 분할된 소번역단위들의 번역 어순은 상기 소번역단위 분할패턴과 경계를 이루는 앞뒤 소번역단위의 품사들과의 관계에 따라 결정된다. 또한, 번역 어순은 사용자가 하나의 원문문장을 복수의 소번역단위들로 분할하면서 분할되는 소번역단위들간의 어순을 지정하여 결정될 수 있다. 또한, 원문문장에 대해 분할된 소번역단위들 및 이에 지정된 번역어순 순번을 사용자에게 디스플레이하고 사용자가 잘못된 어순을 수정하여 소번역단위들 및 각 소번역단위들에 지정한 번역어순 순번을 입력하면, 사용자가 입력한 소번역단위들 및 각 소번역단위들에 지정한 번역어순 순번으로부터 번역어순패턴을 결정한다. Each of the small translation unit division patterns 300 is applied as a division rule for dividing the small translation unit into small translation units when there are morpheme sequences that match each of the small translation unit division patterns among the morpheme sequences obtained by morphological analysis of another sentence. In this case, the translation word order of the divided sub-translation units is determined according to the relations with the parts of speech of the front-rear sub-translation units forming a boundary with the sub-translation unit division pattern. In addition, the translation word order may be determined by designating the word order between the small translation units that are divided while the user divides one original text into a plurality of small translation units. In addition, when the sub-translation units divided by the original text and the translation order assigned thereto are displayed to the user, and the user corrects an incorrect word order and inputs the translation-translation order assigned to the small translation units and each small translation unit, the user The translation order sequence is determined from the sub-translation units entered by the user and the translation order assigned to each sub-translation unit.
도 4a에서 원문문장을 번역 소단위들인 소번역단위(3)들로 분할하고 형태소 분석한 각각의 분할패턴들(300)에 /1, /6, /3, /2, /5, /4와 같이 번역어순 순번(5)이 지정되어 있다. 이러한 번역어순패턴들은 번역어순패턴 DB(160)에 저장되고, 이후 다른 문장을 번역할 때 동일한 소번역단위 분할패턴이 나타나면, 그 다른 문장을 상기 동일한 소번역단위 분할패턴에 의해 소번역단위로 분할할 수 있고, 6개의 소번역단위 분할패턴이 모두 매칭된다면 동일한 번역어순패턴을 적용하여서, 기계적으로 분할할 수 있는 분할 어순결정 규칙 없이 번역어순패턴만으로 원문문장을 소번역단위로 분할하고 각 소번역단위들 간에 번역어순을 결정할 수 있다. In FIG. 4A, the original text is divided into sub-translational units (3), which are translation sub-units, and each division pattern 300 which is morphologically analyzed as / 1, / 6, / 3, / 2, / 5, and / 4. A translation word order (5) is specified. These translation order patterns are stored in the translation order pattern DB 160, and when the same small translation unit division pattern appears when translating another sentence, the other sentences are divided into small translation units by the same small translation unit division pattern. If all six small translation unit division patterns are matched, the same original translation order pattern is applied, and the original sentence is divided into small translation units only by the translation word order without the division word order decision rule that can be mechanically divided. The order of translation can be determined between units.
도 4a에서와 같이, 번역어순패턴 생성유닛(150)에서 번역어순패턴(401, 405)에 기초하여 파생된 번역어순패턴(402 - 404, 406 - 408)이 생성될 수 있다. 도 4a의 번역어순패턴에 포함된 소번역단위 분할패턴은 분할패턴 형태소열 중 하나 이상의 품사를 포함하는 분할패턴 시작부, 분할패턴 중간부 및 분할패턴 형태소열 중 하나 이상의 품사를 포함하는 분할패턴 종료부로 구성된 것을 도시한다. 일 실시예에서, 분할패턴 시작부 및 분할패턴 종료부는 각각 소번역단위 분할패턴을 구성하는 형태소열 중 하나 이상의 품사로 구성될 수 있고, 분할패턴 중간부는 생략될 수 있다. 번역어순패턴 생성유닛(150)은 분할패턴 중간부를 생략하고, 분할패턴 시작부 및 분할패턴 종료부를 포함하는 소번역단위 분할패턴들, 및 상기 소번역단위 분할패턴들에 지정된 번역어순 순번을 포함하는 번역어순패턴을 생성할 수 있다. 분할패턴 시작부는 원문문장의 문두부터 문미까지 소번역단위들로의 분할이 시작되는, 소번역단위의 형태소열 중 맨 앞에 위치하는 하나 이상의 품사를 포함한다. 또한, 분할패턴 종료부는 소번역단위들로의 분할이 종료되는, 소번역단위의 형태소열 중 마지막에 위치하는 하나 이상의 품사를 포함한다. 원문문장을 소번역단위들로 분할할 때 분할되는 부분이 전치 품사와 후치 품사의 사이이기 때문에, 후치 품사와 전치 품사에 상응하는 부분인 소번역단위 분할패턴의 분할패턴 시작부와 분할패턴 종료부만 갖추어지면, 소번역단위 분할 패턴에 의해 원문문장을 소번역단위들로 분할할 수 있다. 소번역단위 분할패턴의 시작부는 분할어순규칙 분할쌍에 의해 분할되는 앞뒤 소번역단위 중 후치품사를 포함하는 소번역단위의 분할패턴의 시작부에 대응한다. 또한 소번역단위 분할패턴의 종료부는 분할어순규칙 분할쌍에 의해 분할되는 앞뒤 소번역단위 중 전치품사를 포함하는 소번역단위 분할패턴의 종료부에 대응한다. As shown in FIG. 4A, the translation order patterns 402 to 404 and 406 to 408 derived based on the translation order patterns 401 and 405 may be generated in the translation order pattern generating unit 150. The small translation unit division pattern included in the translation word order pattern of FIG. 4A includes a division pattern start part including one or more parts of the division pattern morpheme sequences, a division pattern middle part, and an end division pattern including one or more parts of the division pattern morpheme sequences. It shows what consists of wealth. According to an embodiment, the division pattern start part and the division pattern end part may be composed of one or more parts of speech included in the small translation unit division pattern, and the middle part of the division pattern may be omitted. The translation word order pattern generating unit 150 omits the middle part of the split pattern, and includes small translation unit split patterns including a split pattern start part and a split pattern end part, and a translation word sequence number specified in the small translation unit split patterns. A translation word order pattern can be generated. The beginning of the division pattern includes one or more parts of speech located at the beginning of the morphological sequence of the sub-translational unit, from which the division into the sub-translational units from the beginning to the end of the original sentence begins. In addition, the segmentation pattern end part includes one or more parts of speech located last in the morphological sequence of the small translation unit, in which the division into the small translation units is terminated. When the original sentence is divided into sub-translational units, the segmentation part is between the pre-positional part and the post-partial part-of-speech. If provided, the original text can be divided into sub-translational units by the sub-translational unit division pattern. The beginning of the sub-translational unit division pattern corresponds to the beginning of the division pattern of the sub-translational unit including post-parts of the front and rear sub-translation units divided by the divisional word order division pair. The end portion of the sub-translational unit division pattern corresponds to the end portion of the sub-translational unit division pattern including prepositions among the front and rear sub-translation units divided by the divisional word order division pair.
분할패턴 중간부 생략 확장 번역어순패턴1의 첫번째 행(402)에서 첫번째 소번역단위 분할패턴(AR *** N)은 분할패턴 시작부(AR) 및 분할패턴 종료부(N)가 각각 하나의 품사로 이루어져 있고, 분할패턴 중간부(***)는 3개의 품사가 생략되었음을 표시한다. 분할패턴 중간부 생략 확장 번역어순패턴2의 첫번째 행(403)에서 첫번째 소번역단위 분할패턴(AR A*** N)은 분할패턴 시작부(AR A)가 2개의 품사이며 분할패턴 종료부(N)는 분할 패턴의 형태소열 중 마지막 명사를 나타내는 품사 N부터 끝까지의 품사로 표시되고, 분할패턴 중간부(**)는 2개의 품사가 생략되었음을 표시한다. 분할패턴 중간부 생략 확장 번역어순패턴2의 첫번째 행(403)에서, 제 4 분할패턴의 분할패턴 종료부(N PR)는 명사 N부터 끝까지의 품사 PR(우괄호)로 표시된다. 분할패턴 중간부 생략 번역어순패턴3의 첫번째 행(404)은 첫번째 소번역단위 분할패턴들이 분할패턴 시작부(AR, V R, PL, V, D)(302) 및 분할패턴 종료부(N, N, N, PR, SY, RV)(303)를 갖고, 분할패턴 중간부가 모두 생략된 형태(-)로 표시된다. 도 4a에서 번역어순패턴 및 분할패턴 중간부 생략 확장 번역어순패턴의 두번째 행(405-408)은 첫번째 행(401-404)의 번역어순패턴을 원문문장의 형태소열과 매칭시키기 용이하도록 형태소열로 표시한 실시예다. 각각의 분할패턴은 문두부터 해당 분할단위가 종료되는 품사위치(395)까지 카운팅한 숫자로 표시된다. 번역어순패턴은 이러한 분할단위(395)와 해당 분할단위에 지정되는 순번(5)으로 표시될 수 있다. 이러한 표시법은 원문문장의 형태소열과 매칭되는 형태소열을 갖는 번역어순패턴을 검색할 때 유용하며, 매칭되는 형태소열을 갖는 번역어순패턴을 가져온 뒤 가져온 번역어순패턴에 지정된 분할패턴들 및 번역어순 순번에 의해 원문문장을 소번역단위들로 분할할 수 있다. Omitting the split pattern middle part In the first row 402 of the extended translation order pattern 1, the first small translation unit split pattern AR *** N has one split pattern start part AR and one split pattern end part N, respectively. It consists of parts of speech, and the middle part of the division pattern (***) indicates that three parts of speech are omitted. Split pattern middle part skipped In the first row 403 of the extended translation order pattern 2, the first sub-translational unit split pattern AR A *** N has two parts-of-speech at the beginning of the split pattern AR A and the split pattern end part ( N) is a part-of-speech part N to the end of the morpheme sequence of the split pattern, and the middle part (**) of the split pattern indicates that two parts of speech are omitted. In the first row 403 of the split pattern middle part skipped extended word order pattern 2, the split pattern end part N PR of the fourth split pattern is represented by a part-of-speech PR (bracket) from the noun N to the end. Omitting the split pattern middle part The first row 404 of the translation word order pattern 3 includes the split pattern start part (AR, VR, PL, V, D) 302 and the split pattern end part (N, N). , N, PR, SY, RV) 303, and the middle part of the division pattern is represented by a form (-) in which all the division patterns are omitted. In FIG. 4A, the second row 405-408 of the translation word order pattern and the split pattern middle part omitted extended translation word order pattern are displayed in morphological order so as to easily match the translation word order pattern of the first line 401-404 with the morphological sequence of the original sentence. One embodiment. Each division pattern is represented by a number counting from the beginning to the part-of-speech position 395 where the division unit ends. The translation word order pattern may be represented by the division unit 395 and the order number 5 assigned to the division unit. This notation is useful for retrieving translation order patterns with morphological sequences that match the morphological sequences of the original text. The original text can be divided into sub-translational units.
도 4b는 본 발명의 실시예에 따라 원문을 형태소 분석하고 도 3f의 소번역단위들로 분할한 소번역단위 분할패턴들에 번역어순 순번을 포함하는 번역어순패턴 및 번역어순패턴을 기초로 파생된 분할패턴 중간부를 생략한 확장된 번역어순패턴의 실시예들이다. 도 4b는 도 4a의 번역어순패턴의 마지막 소번역단위 분할패턴 D PP N RV SY(391)이 전치사 for 및 하나 이상의 품사를 포함하는 소번역단위 분할패턴으로 변경된 번역 패턴(392)을 갖는 번역어순패턴(409 - 416)을 도시한다. 일 실시예에서, 하나 이상의 원문 단어 및 품사를 포함하는 번역 패턴은 원문문장의 번역 소단위가 될 수 있기 때문에 소번역단위 분할패턴이 될 수 있다. FIG. 4B is derived based on a translation word order pattern and a translation word order pattern including a translation word order in a small translation unit division pattern divided into small translation units of FIG. 3F according to an embodiment of the present invention. Embodiments of the expanded translation order pattern omitting the middle part of the division pattern. FIG. 4B is a translation order in which the last small translation unit division pattern D PP N RV SY 391 of the translation word order pattern of FIG. 4A is changed to a small translation unit division pattern including a preposition for and one or more parts of speech. Patterns 409-416 are shown. In one embodiment, the translation pattern including one or more original words and parts of speech may be a small translation unit division pattern because the translation subunit of the original sentence may be a translation unit.
도 4c는 본 발명의 실시예에 따라 도 3h에서 각 소번역단위들에 대한 소번역문들의 번역 후 생성되는 번역어순패턴에 문미 번역어를 더 포함하는 분할어순 패턴의 실시예들이다. 문미 번역어는 한국어에서 '은', '는', '이', '가', '을', 및 '를'와 같은 조사, '다', '하는', 및 '되는'과 같은 어미, 또는 '를 위해', '에 대해'와 같이 전치사의 번역어를 포함할 수 있다. 문미 번역어는 각 소번역단위들의 번역문인 소번역문들의 문미 번역어를 의미한다. 이러한 문미 번역어는 번역어순패턴에 의해 소번역단위들의 번역어순이 정해지고, 각 소번역단위들을 자동번역했을 때, 각 소번역문들 간을 연결하는데 이용될 수 있다. 문미 번역어(396)는 어순과 같이 분할패턴마다 지정해지는 것은 아니며, 서술어와 같이 자동번역된 문미 번역어와 동일한 경우, 필요에 따라 생략될 수 있다. 이러한 문미 번역어를 통해 소번역단위 분할패턴의 문장 성분을 파악할 수 있다. 예를 들어, 한국어에서 문미 번역어(396)가 '은', '는'이면, 소번역단위가 주격임을 알 수 있고, '을' '를'이면 목적격임을 알 수 있다. FIG. 4C illustrates embodiments of a segmented word order pattern further including a lateral translation in a translation word order pattern generated after translation of small translations of each of the small translation units in FIG. 3H according to an exemplary embodiment of the present invention. Translations of the terminology in Korean are investigated such as 'silver', 'a', 'this', 'a', 'a', and 'a', a ending such as 'a', 'a', and 'being', or May include prepositional translations such as for and about. The literary translation means the translation of the small translations, which are the translations of each of the small translation units. The translated word order is defined by the translation order, and the translation order of the small translation units is determined. When the small translation units are automatically translated, they can be used to connect the small translations. The sentence translation 396 is not specified for each division pattern like the word order, and may be omitted as necessary if the translation sentence 396 is the same as the sentence translation automatically translated. Through the sentence translation, it is possible to grasp the sentence component of the small translation unit division pattern. For example, if the grammar translation 396 in Korean is' silver 'and' is', it can be seen that the sub-translational unit is the main subject, and if 'ul' '' 'is the object.
도 4e은 도 4a의 분할패턴 중간부 생략 확장 번역어순패턴들(402-404, 406-408)과 같이, 파생된 번역어순패턴의 실시예들을 도시한다. 도 4a 또는 도 4b의 번역어순패턴(401, 405, 409, 413)으로부터 정해진 규칙에 따라, 확장 번역어순패턴 생성 유닛(150)에서 확장 번역어순패턴을 생성할 수 있다. 이들 확장 번역어순패턴은 번역어순패턴 DB(160)에 저장된다. 또한, 번역어순패턴 DB(160)를 분석하여 분할패턴의 형태소열들 및 이의 번역어순 순번에서 일정한 규칙성을 찾아서, 번역어순패턴을 확장시킬 수 있다. 확장된 번역어순패턴의 생성은 번역어순패턴 DB(160)의 번역어순패턴들을 분석하여 일정한 규칙성을 찾을 수 있고, 이미 결정된 번역어순패턴에서 분할패턴들을 병합하고 결번되는 번호를 다음으로 큰 순번을 갖는 소번역단위 분할패턴의 순번이 갖도록 하나씩 감소시켜 생성할 수 있다. 산술적으로 6개의 소번역단위들로 분할되고 순번이 지정된 번역어순패턴은 2개의 소번역단위들로 병합시켜서 5개, 4개, 3개 및 2개의 순번을 갖는 소번역단위 분할패턴들로 병합시킬 수 있다. 도 4d는 도 2a의 원문문장으로부터 2개 내지 6개의 분할패턴들을 갖는 번역어순패턴들(430-441) 및 분할패턴 중간부 생략 확장 번역어순패턴(442-453)을 도시한다. FIG. 4E illustrates embodiments of the derived translation word order pattern, such as the split pattern middle portion omitted extended word order patterns 402-404, 406-408 of FIG. 4A. According to a rule determined from the translation word order patterns 401, 405, 409 and 413 of FIG. 4A or 4B, the extended translation word order pattern generating unit 150 may generate the extended translation word order pattern. These extended translation order patterns are stored in the translation order patterns DB 160. In addition, the translation order pattern DB 160 may be analyzed to find a regularity in the morpheme sequences of the division pattern and the translation order, thereby expanding the translation order pattern. The generation of the extended translation order pattern can find a regularity by analyzing the translation order patterns of the translation order pattern DB 160, merging the split patterns from the already determined translation order pattern, and then assigning the missing number to the next largest order. It can be generated by decreasing one by one so that the order of the small translation unit division pattern having. The arithmetically divided translation sequence pattern divided into six sub-translational units is merged into two sub-translational units and merged into five, four, three, and two sub-translational unit division patterns. Can be. 4D illustrates translation word order patterns 430-441 having two to six division patterns from the original sentence of FIG. 2A, and an abbreviated translation translation order pattern 442-453 in the middle of the division pattern.
부연하면, 도 4e는 도 4a에서 번역어순에 영향을 미치지 않는 괄호 부분을 생략하고 번역어순을 다시 결정한 것을 도시한다. 여기서, PL PP V A N PR / 2이 생략됨에 따라 2보다 큰 번역어순 순번들은 하나씩 감소되어 새로운 번역어순 순번이 결정된다. 4E illustrates that the translation order is again determined by omitting the parentheses in FIG. 4A that do not affect the translation order. Here, as PL PP V A N PR / 2 is omitted, translation order numbers larger than 2 are reduced by one, thereby determining a new translation order number.
도 4e의 다섯번째 행(480-484)는 도 4a의 번역어순패턴으로부터 번역어순 순번 /2, /3 및 /4의 관계사절을 하나의 소번역단위로 병합함으로써 AR A N / 1, V AR A N / 3 및 R Clause/3의 3개로 구성되는 확장 번역어순패턴을 도시한다. 이러한 확장 번역어순패턴은 문장 내에서 관계사절 뿐만 아니라, 특정 that절, which절, 명사절, 부사구, 전치사구, 동사구 등 특정 단어를 포함하는 소번역단위 분할패턴에 지정되는 번역어순패턴의 규칙으로 확장될 수 있다. The fifth row (480-484) of FIG. 4E shows the AR AN / 1, V AR AN / by merging the relative rounds of the translation order sequential / 2, / 3 and / 4 from the translation order sequence pattern of FIG. 4A into one small translation unit. An extended translation order pattern consisting of 3 and R Clause / 3 is shown. The extended translation order pattern can be extended not only in relation clauses but also in rules of translation order patterns specified in sub-translational unit division patterns including specific words such as specific that clauses, which clauses, noun clauses, adverb phrases, prepositional phrases, and verb phrases. have.
2. 기계적 분할 및 번역어순 결정 유닛으로부터 번역어순패턴 데이터 생성2. Generation of translation order pattern data from mechanical segmentation and translation order determination unit
도 1a는 본 발명의 실시예에 따라 원문문장을 번역문장으로 번역하는 장치 또는 프로그램에서 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하는 컴퓨터 실행가능 명령어들을 저장한 저장매체, 프로그램, 또는 장치의 일 실시예이다. 이러한 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 실행가능 명령어들을 저장한 저장매체, 번역 프로그램, 방법 및 장치는 개인용 컴퓨터, 모바일 장치, 서버 또는 이들의 2 이상의 장치가 결합된 네트워크 상에서 이용될 수 있다. 본 발명의 번역어순패턴 데이터 구조, 상기 컴퓨터 실행가능 명령어들을 저장한 저장매체 또는 번역 프로그램은 다운로드 가능한 형태로 서버를 통해 배포될 수 있다. 1A is a diagram illustrating a computer executable instruction for dividing a source sentence into sub-translation units, which are translation sub-units, and for determining translation order of the sub-translation units in an apparatus or program for translating the original sentence into a translation sentence, according to an embodiment of the present invention. One embodiment is a storage medium, program, or device. A storage medium, a translation program, a method and an apparatus for dividing the original text into sub-translation units, which are translation sub-units, and storing computer executable instructions for determining the translation order of the sub-translation units may be a personal computer, a mobile device, a server, or Two or more devices of may be used on a combined network. The translation order pattern data structure of the present invention, a storage medium storing the computer executable instructions, or a translation program may be distributed through a server in a downloadable form.
본 발명에 따른 컴퓨터 판독 가능 저장 매체는 컴퓨터 하드 디스크, 메모리, SSD, USB 등 광학적으로, 자기적으로, 또는 기타 방법으로 데이터가 저장되는 컴퓨터에서 판독 가능한 물리적 구성요소 또는 물질을 의미한다. A computer readable storage medium according to the present invention refers to a computer readable physical component or material in which data is stored optically, magnetically, or by other means, such as a computer hard disk, memory, SSD, USB, or the like.
도 2a는 본 발명에서 예시적으로 사용되는 원문문장(1) 및 원문에 대한 번역문장(1)을 도시한다. 원문문장 수신 유닛(100)은 번역할 원문문장(1)을 불러 들인다. 형태소 분석 및 품사 태깅 유닛(110)은 원문문장 수신 유닛(100)으로부터 원문문장을 입력 받아 형태소 분석을 수행하고 품사를 태깅한다. 형태소분석 (morphological analysis)은 자연언어 분석의 첫 단계로써, 원문문장의 입력문자열을 형태소열로 바꾸는 작업을 한다. 형태소(morpheme)는 의미의 최소단위로써, 더 이상 분석 불가능한 가장 작은 의미 요소를 말한다. 예를 들어, 형태소는 문법적 혹은 관계적인 뜻을 나타내는 단어 또는 단어의 부분, 그리고 단순어의 어근(원형), 어미나 조사, 접두사, 접미사 등이다. 구두점(','), 세미콜론(';')과 같은 원문문장을 구성하는 원문 부호들도 품사처럼 구두점, 세미콜론으로 태킹되고 품사와 동일하게 취급된다. 본 발명에서 한국어가 원문 언어인 경우, 한국어 형태소의 품사를 '명사, 대명사, 수사, 동사, 형용사, 관형사, 부사, 감탄사, 조사, 어미, 접사, 어근, 부호, 한글 이외'와 같이 나누고 각 품사를 태깅할 수 있다. 본 발명에서 중국어가 원문 언어인 경우, 중국어 형태소의 품사를 '명사, 대명사, 수사, 양사, 동사, 형용사, 부사, 전치사, 조사, 부호, 중국어 이외'와 같이 나누고 각 품사를 태깅할 수 있다. FIG. 2A shows the original sentence 1 used in the present invention and the translated sentence 1 for the original. The original sentence receiving unit 100 calls the original sentence 1 to be translated. The morpheme analysis and part-of-speech tagging unit 110 receives the original sentence from the original sentence receiving unit 100 to perform morphological analysis and tag the part-of-speech. Morphological analysis is the first step in the analysis of natural language. The morphological analysis is the process of converting the original text into a morphological sequence. A morpheme is the smallest unit of meaning, the smallest semantic element that can no longer be analyzed. For example, morphemes are words or parts of words that represent grammatical or relational meanings, as well as roots, primitives, surveys, prefixes, and suffixes of simple words. The text symbols that make up the original sentence, such as punctuation marks (',') and semicolons (';'), are also tagged with punctuation and semicolons and are treated like parts of speech. In the present invention, when Korean is the original language, parts of Korean morphemes are divided into 'nouns, pronouns, verbs, adjectives, adjectives, adverbs, interjections, investigations, endings, affixes, roots, signs, and other Korean parts of speech.' Can be tagged. In the present invention, if the Chinese is the original language, the parts of Chinese morphemes can be divided into 'nouns, pronouns, rhetoric, quantums, verbs, adjectives, adverbs, prepositions, investigations, signs, other than Chinese', and tag each part of speech.
형태소 분석 및 품사 태깅 유닛(110)은 명사 및 명사의 형태소열과 같은 복합명사를 하나의 명사로 처리하거나, 전치사 및 관계사의 형태소열을 관계사로 묶는 등 복수의 품사를 하나의 품사로 묶는 청킹(chunking) 유닛을 포함할 수 있다. The morpheme analysis and part-of-speech tagging unit 110 processes multiple nouns such as nouns and morphological sequences of nouns into one noun, or chunking a plurality of parts-of-speech into a part-of-speech by tying the morphological sequences of prepositions and related parties into related parts. ) May include a unit.
원문문장을 형태소 분석하고 품사 태깅한 형태소열은 소번역단위 및 번역어순 결정 유닛(120)으로 입력되어 소번역단위들로 분할되고, 각 소번역단위들에 번역어순 순번을 결정한다. 소번역단위 및 번역어순 결정 유닛(120)은 번역어순패턴 매칭유닛(121) 및 기계적 분할 및 어순 결정 유닛(122)을 포함한다. The morpheme sequence obtained by morphological analysis of the original sentence and part-of-speech tagging is inputted into the small translation unit and the translation word order determination unit 120, and is divided into the small translation units, and the translation order is determined in each small translation unit. The small translation unit and the translation word order determination unit 120 include a translation word order pattern matching unit 121 and a mechanical segmentation and word order determination unit 122.
번역어순패턴 매칭 유닛(121)은 형태소 분석된 원문문장의 형태소열과 일치하는 소번역단위 분할 패턴을 포함하는 번역어순패턴(401-484)이 있는지 번역어순패턴 및 확장된 번역어순패턴이 저장된 번역어순패턴 DB(160)을 검색하고, 번역어순패턴 DB(160)에 전체적으로 또는 부분적으로 매칭되는 번역어순패턴(401-484)이 존재하면, 매칭된 번역어순패턴(401-484)에 포함된 소번역단위 분할패턴에 따라 원문문장을 소번역단위들로 분할하고 각 소번역단위들에 번역어순 순번을 결정한다. 번역어순패턴 DB(160)는 번역어순패턴(401-484)을 기록한 컴퓨터 판독가능한 저장매체들 중 하나이다. 형태소 분석된 원문문장의 형태소열과 전체적으로 매칭된다고 함은 번역어순패턴의 모든 소번역단위 분할패턴의 형태소열이 원문의 형태소열과 일치함을 의미하고, 부분적으로 매칭되는 번역어순패턴은 원문문장의 형태소열과 비교하여 모든 소번역단위 분할패턴들 중 하나를 제외한 나머지 소번역단위 분할패턴들의 형태소열이 매칭되는 번역어순패턴을 의미한다. 모든 소번역단위 분할패턴들 중 하나가 다른 경우에도, 그 다른 하나를 제외한 다른 매칭되는 분할패턴들 및 그 순번들이 매칭되기 때문에, 상기 매칭되는 분할패턴들 및 그 순번들로부터 매칭되지 않는 상기 하나의 분할패턴 및 그 순번을 유추할 수 있기 때문이다. 또한, 부분적으로 매칭되는 번역어순패턴은 원문문장을 소번역단위들로 분할했을 때 소번역단위들 각각의 형태소열 중 맨 앞에 위치된 하나 이상으로 구성된 소번역단위 분할패턴 시작부 그리고 소번역단위들 각각의 형태소열 중 마지막에 위치된 하나 이상으로 구성된 소번역단위 분할패턴 종료부와 매칭되는 번역어순패턴을 의미한다. 소번역단위 분할패턴 시작부 및 소번역단위 분할패턴 종료부로 구성된 소번역단위 분할패턴들 및 이들 분할패턴들 각각에 지정된 순번을 포함하는 번역어순패턴은 원문문장의 형태소열과 전체적으로 매칭되지 않는 경우라도, 소번역단위 분할패턴들 각각의 시작부와 종료부만 매칭되면 원문을 소번역단위들로 분할할 수 있고 상기 시작부 및 종료부를 포함하는 분할패턴들에 지정된 번역어순 순번에 따라 분할된 상기 소번역단위들에 번역어순 순번을 결정할 수 있다. 그러므로, 원문문장의 각 소번역단위들의 형태소열의 시작부 및 종료부가 매칭되는 번역어순패턴에 포함된 분할패턴 시작부와 분할패턴 종료부를 포함하는 각각의 분할패턴들과 모두 일치하는 경우 원문문장을 분할하고 분역어순을 결정하는 상기 어순결정 패턴으로 이용할 수 있다. The translation word order matching unit 121 checks whether there is a translation word order pattern (401-484) including a small translation unit segmentation pattern that matches the morpheme sequence of the original sentence. When the pattern DB 160 is searched and there is a translation word order pattern 401-484 that is matched in whole or in part with the translation word order pattern DB 160, the small translation included in the matched translation word order pattern 401-484 The original text is divided into small translation units according to the unit division pattern, and the translation order is determined in each small translation unit. The translation order pattern DB 160 is one of computer-readable storage media recording the translation order patterns 401-484. When the morphological sequence of the original sentence is matched globally, it means that the morphological sequence of all the sub-translational unit division patterns of the translated word sequence pattern matches the morphological sequence of the original text. By comparison, it means a translation word order pattern in which the morphological sequences of the remaining small translation unit division patterns except one of all the small translation unit division patterns are matched. Even if one of all the sub-translational unit division patterns is different, other matching division patterns except the other one and their sequence numbers are matched, so that the one that does not match from the matching division patterns and sequence numbers is matched. This is because the division pattern and the sequence number can be inferred. In addition, the partially matched translation word order pattern may include a small translation unit division pattern beginning and a small translation unit composed of one or more of the morphological sequences of each of the small translation units when the original text is divided into small translation units. The translation word order pattern is matched with the end of the small translation unit division pattern consisting of one or more positions positioned at the end of each morpheme sequence. Even if the small translation unit division patterns composed of the small translation unit division pattern start part and the small translation unit division pattern end part, and the translation word order pattern including the sequence number assigned to each of these division patterns are not matched with the morphological sequence of the original sentence as a whole, If only the start and end portions of each of the small translation unit division patterns are matched, the original text can be divided into small translation units, and the small translations are divided according to the translation word order assigned to the division patterns including the beginning and end sections. The order of translation can be determined in units. Therefore, when the beginning and end of the morphological sequence of each sub-translation unit of the original sentence match all the division patterns including the division pattern start part and the division pattern end part included in the matching translation order pattern, the original sentence is divided. It can be used as the word order determination pattern to determine the order of division.
기계적 분할 및 어순 결정 유닛(122)은 도 2b 및 도 2c과 같은 분할 어순 규칙에 따라 원문문장을 복수의 소번역단위들로 분할하고 분할된 소번역단위들에 번역어순 순번을 결정한다(도 3f 참조). 기계적 분할 및 번역어순결정은 도 3a 내지 도 3i를 설명하는 부분에서 더 설명한다. The mechanical division and word order determination unit 122 divides the original text into a plurality of small translation units according to the division word order rules as shown in FIGS. 2B and 2C and determines the translation word order number in the divided small translation units (FIG. 3F). Reference). Mechanical partitioning and translation ordering are further described in the section describing FIGS. 3A-3I.
소번역단위 및 번역어순 결정 유닛(120)에서 번역어순패턴 매칭유닛(121) 및 기계적 분할 및 어순 결정 유닛(122)은 서로 중첩적으로 적용될 수 있다. 즉, 번역어순패턴 매칭유닛(121)으로 번역어순패턴 DB(160)를 검색하고 매칭된 번역어순패턴이 없는 경우 기계적 분할 및 어순 결정 유닛(122)을 통해 원문문장을 소번역단위들로 기계적으로 분할하고 각 소번역단위들의 번역어순 순번을 결정할 수 있다. 또한, 기계적 분할 및 어순 결정 유닛(122)을 통해 원문문장을 소번역단위들로 기계적으로 분할하고 각 소번역단위들의 번역어순 순번을 결정한후, 번역어순패턴 매칭유닛(121)으로 번역어순패턴 DB(160)를 검색하고 매칭된 번역어순패턴이 있는 경우 기계적 분할 및 어순 결정 유닛(122)을 통해 결정된 소번역단위들 및 이들 각각의 지정된 순번들을 리셋하고 번역어순패턴 매칭유닛(121)에서 불러온 번역어순패턴을 적용시킬 수 있다. 또한, 본 발명의 다른 실시예에서, 기계적 분할 및 어순 결정 유닛(122)을 통해 원문문장을 소번역단위들로 기계적으로 분할하고 번역어순패턴 매칭 유닛(121)을 통해 분할된 소번역단위들에 대해 번역어순 순번을 결정할 수 있다. 즉, 분할된 소번역단위들의 형태소열들에 대해, 번역어순패턴 매칭유닛(121)으로 번역어순패턴 DB(160)를 검색하고 매칭되는 형태소열들을 포함하는 번역어순패턴이 있는 경우 소번역단위들로의 분할은 기계적 분할 및 어순 결정 유닛(122)을 통해 수행하고, 분할된 소번역단위들의 번역어순 순번은 번역어순패턴 매칭유닛(121)에서 불러온 번역어순패턴의 번역어순 순번으로 결정할 수 있다.In the small translation unit and the translation word order determination unit 120, the translation word order pattern matching unit 121 and the mechanical segmentation and word order determination unit 122 may be applied to each other. That is, when the translation order pattern matching unit 121 searches the translation order pattern DB 160 and there is no matched translation order pattern, the original sentence is mechanically translated into small translation units through the mechanical segmentation and word order determination unit 122. Can be divided and the translation order of each small translation unit can be determined. Further, after mechanically dividing the original text into small translation units through the mechanical segmentation and word order determination unit 122 and determining the translation order of each small translation unit, the translation order pattern DB is translated into the translation order pattern matching unit 121. (160) and if there is a matched translation word order pattern, the small translation units determined through the mechanical segmentation and word order determination unit 122 and their respective designated order numbers are reset and retrieved from the translation word order matching unit 121. Translation order pattern can be applied. Further, in another embodiment of the present invention, the original text is mechanically divided into small translation units through the mechanical segmentation and word order determination unit 122, and divided into the small translation units divided through the translation word pattern matching unit 121. The order of translation can be determined. That is, with respect to the morphological sequences of the divided sub-translation units, the translation sequence pattern matching unit 121 searches for the translation sequence pattern DB 160, and if there is a translation sequence pattern including the matching morpheme sequences, the small translation units The division into a row is performed through the mechanical division and word order determination unit 122, and the translation word order of the divided small translation units may be determined as the translation word order of the translation word order pattern imported from the translation word order pattern matching unit 121. .
소번역단위 및 번역어순 결정 유닛(120)은 원문문장에 대해 복수의 소번역단위들로의 분할과 분할된 소번역단위에 대해 지정된 번역어순 순번을 출력한다. 소번역단위 및 번역어순 결정 유닛(120)으로부터 출력된 복수의 소번역단위들 및 분할된 소번역단위에 대해 지정된 번역어순 순번은 사용자 인터페이스 유닛(140)으로 입력되어 사용자에게 표시 유닛(141)을 통해 디스플레이될 수 있고, 사용자는 복수의 소번역단위들로의 분할과 분할된 소번역단위에 대해 지정된 번역어순 순번을 입력 유닛(142)으로 수정하여 입력하면, 수정된 입력 사항을 소번역단위 및 번역 어순 결정 유닛(120) 또는 번역어순패턴 생성유닛(150)이 수신하여 각각 상기 입력 사항으로 소번역단위 및 번역어순 순번을 결정하거나 번역어순패턴을 생성할 수 있다. 번역어순패턴 생성 유닛(150)을 통해 사용자가 수정한 번역어순패턴을 번역어순패턴 DB(160)에 저장할 수 있다. The small translation unit and the translation word order determination unit 120 output the original text sentence into a plurality of small translation units for the original text and a designated translation word sequence number for the divided small translation unit. The plurality of small translation units outputted from the small translation unit and the translation word order determining unit 120 and the translation word order designated for the divided small translation unit are input to the user interface unit 140 to display the display unit 141 to the user. When the user inputs the divided translation into a plurality of small translation units and modifies the translation sequence number specified for the divided small translation units into the input unit 142, the modified input items are inputted into the small translation unit and The translation word order determining unit 120 or the translation word order pattern generating unit 150 may receive a small translation unit and a translation word order as the input items, respectively, or generate a translation word order pattern. The translated word order pattern generating unit 150 may store the translated word order pattern modified by the user in the translated word order pattern DB 160.
본 발명의 다른 실시예에서, 소번역단위 및 번역어순 결정 유닛(120)은 기계적 분할 및 어순 결정 유닛(121)에서 기계적으로 하나의 원문문장 또는 소번역단위를 2 개 소번역단위들로 분할할 때마다 표시 유닛(141)에 상기 결정된 소번역단위들 및 이들 각각에 지정된 순번을 사용자에게 표시할 수 있다. 이후 사용자가 분할된 소번역단위들 및 분할된 소번역단위에 대해 지정된 번역어순 순번을 입력 유닛(142)으로 수정하여 입력하면, 기계적 분할 및 어순 결정 유닛(121)에서 기계적으로 결정된 소번역단위들 및 이들 각각에 지정된 순번에 우선하여 상기 입력 유닛(142)을 통해 입력된 소번역단위 및 번역어순 순번을 원문문장에 대한 소번역단위들 및 이들 각각의 번역어순 순번으로 결정할 수 있다. 사용자는 입력 유닛(142)을 통해 직접 소번역단위들 및 분할된 소번역단위들 각각에 대해 순번을 결정하는 신호를 입력할 수 있을 뿐만 아니라, 분할되는 소번역단위들의 위치 정보 및 분할된 소번역단위들의 순번을 포함하는 제 1 신호 또는 제 2 신호를 입력하여, 소번역단위들 및 이들 각각에 지정되는 번역어순 순번을 결정할 수 있다(도 5a 내지 도 5m 참조). 이 경우에서, 소번역단위 및 번역어순 결정 유닛(120)은 제 1 신호 또는 제 2 신호에 의해 2 개의 소번역단위들로 분할될 때마다 표시유닛(141)에 상기 결정된 소번역단위들 및 이들 각각에 지정된 순번을 사용자에게 표시할 수 있다. 일 실시예에서, 제 1 신호는 마우스 좌클릭이고, 제 2 신호는 연속적인 마우스 좌클릭 및 우클릭이다. 소번역단위 및 번역어순 결정 유닛(120)으로부터 출력된 복수의 소번역단위들 및 각 소번역단위들에 대해 지정된 번역어순 순번은 사용자 인터페이스 유닛(140)으로 입력된다. In another embodiment of the present invention, the small translation unit and the translation order determination unit 120 may mechanically divide one original sentence or the small translation unit into two small translation units in the mechanical division and word order determination unit 121. Each time, the display unit 141 may display the determined small translation units and the order assigned to each of them. Subsequently, when the user modifies and inputs the divided sub-translation units and the translation word order designated for the divided sub-translation units into the input unit 142, the small translation units determined mechanically in the mechanical division and word order determination unit 121. And a small translation unit and a translation language sequence number input through the input unit 142, in preference to the sequence number assigned to each of them, as the small translation units for the original text and their respective translation language sequence. The user can directly input the signal for determining the order number for each of the small translation units and the divided small translation units through the input unit 142, as well as the location information of the divided small translation units and the divided small translations. By inputting the first signal or the second signal including the sequence number of the units, it is possible to determine the small translation units and the translation word sequence number assigned to each of them (see FIGS. 5A to 5M). In this case, the small translation unit and the translation word order determining unit 120 are determined in the display unit 141 whenever the small translation unit and the small translation unit are divided into two small translation units by the first signal or the second signal, and these. The order assigned to each can be displayed to the user. In one embodiment, the first signal is mouse left click and the second signal is consecutive mouse left click and right click. The plurality of small translation units output from the small translation unit and the translation word order determining unit 120 and the translation word order designated for each small translation unit are input to the user interface unit 140.
소번역문 및 번역문 생성유닛(130)은 번역 메모리 매칭 유닛(131), 자동번역유닛(132)을 포함하고 문미번역어 처리 유닛(133)을 더 포함할 수 있다. 번역메모리 매칭 유닛(131)은 번역메모리 DB(170)를 검색하여 소번역단위들과 매칭되는 번역메모리를 불러와서 소번역문을 생성할 수 있다. 매칭되는 번역메모리가 없어서 번역메모리를 불러오지 못한 소번역단위들은 자동번역유닛(132)을 통해 자동번역된다(도 3g 참조). 자동번역 유닛(132)은 Google Inc.의 자동번역엔진과 같이, 시판되고 있는 자동번역엔진을 이용할 수 있다. 또한, 소번역문 및 번역문 생성유닛(130)은 번역 메모리 매칭 유닛(131) 및 자동번역유닛(132)을 중첩적으로 적용할 수 있다. 소번역문 및 번역문 생성유닛(130)은 자동번역유닛(132)에서 자동번역을 통해 모든 소번역단위들에 대해 소번역문들을 생성한 후 번역 메모리 매칭 유닛(131)를 검색하고 매칭되는 번역메모리 또는 소번역메모리가 있는 경우 자동번역된 소번역문들에 우선하여 소번역메모리가 존재하는 소번역단위에 대해 소번역문들을 생성할 수 있다. The small translation and the translation generation unit 130 may include a translation memory matching unit 131 and an automatic translation unit 132, and further include a sentence translation processing unit 133. The translation memory matching unit 131 may search for the translation memory DB 170 and retrieve a translation memory matching the small translation units to generate a small translation. The small translation units which failed to load the translation memory because there is no matching translation memory are automatically translated through the automatic translation unit 132 (see FIG. 3G). The automatic translation unit 132 can use a commercially available automatic translation engine, such as an automatic translation engine of Google Inc. In addition, the small translation and the translation generation unit 130 may apply the translation memory matching unit 131 and the automatic translation unit 132 overlapping. The small translation and the translation generation unit 130 generate the small translations for all the small translation units through the automatic translation in the automatic translation unit 132 and then search for the translation memory matching unit 131 and match the translation memory or the small. If there is a translation memory, the small translations may be generated for the small translation unit in which the small translation memory exists in preference to the automatically translated small translations.
본 발명의 일 실시예에서, 소번역단위 및 번역어순 결정, 및 소번역문 및 번역문장 생성은 소번역단위 및 번역어순 결정유닛(120)에서 원문문장을 소번역단위들로 한번씩 분할하고 번역어순 순번을 결정할 때마다 소번역문 및 번역문 생성유닛(130)에서 분할된 복수의 소번역단위들에 대해 소번역문들을 생성하고 이들 소번역단위들에 지정된 번역어순 순번에 따라 정렬하여 전체 번역문장을 생성할 수 있다. 소번역문 및 번역문 생성유닛(130)에서 소번역문의 생성 및 번역문장의 생성한 후 소번역단위들, 번역어순 순번, 생성된 소번역문들, 및 번역문장을 사용자 인터페이스 유닛(140)으로 송신한다. 인터페이스 유닛(140)의 표시 유닛(141)은 결정된 소번역단위들 및 이들 각각에 지정된 순번, 그리고 각 소번역단위들에 대한 소번역문 및 이들 소번역문을 상기 소번역단위들 각각에 지정된 번역어순 순번대로 정렬하여 완성된 번역문장을 사용자에게 표시한다. 사용자가 분할된 소번역단위들 및 분할된 소번역단위에 대해 지정된 번역어순 순번을 입력 유닛(142)으로 수정하여 입력하면, 상기 입력된 소번역단위 및 번역어순 순번, 소번역문을 소번역단위, 번역어순 순번, 소번역문으로 저장할 수 있다. 사용자는 자신이 원문문장을 소번역단위들로 분할하고 번역어순 순번을 결정할 때마다 각 소번역단위들에 대해 번역된 소번역문들 및 번역문장을 확인하면서, 소번역문들의 추가적인 분할을 결정할 수 있다. 또한, 사용자가 원문문장을 소번역단위들로 분할할 때마다 번역어순 매칭 유닛(121)은 각 소번역단위들에 대해 통해 번역어순패턴 DB(160)를 검색할 수 있고, 사용자는 매칭된 번역어순 순번에 따라 소번역문 및 번역문 생성 유닛(130)을 통해 생성된 소번역문들 및 상기 소번역문들을 정렬한 번역문장을 확인하면서, 추가적으로 분할할 것인지, 또는 더이상 분할하지 않고, 번역된 소번역문 및 번역문을 이용할 것인지 결정할 수 있다.  In one embodiment of the present invention, the small translation unit and the translation order determination, and the small translation and the translation sentence generation is divided into small translation units and the original text sentence once in the small translation unit and translation language order determination unit 120 Whenever the decision is made, small translations may be generated for the plurality of small translation units divided by the small translation and the translation generation unit 130, and the total translation sentences may be generated by sorting according to the translation order assigned to the small translation units. have. After generating the small translation and generating the translated sentence, the small translation unit and the translated sentence generation unit 130 transmit the small translation units, the order of translation, the generated small translations, and the translated sentence to the user interface unit 140. The display unit 141 of the interface unit 140 determines the small translation units and the order designated to each of them, and the small translations for each of the small translation units and the translation order assigned to each of the small translation units. Sort them as they are, and display the completed translated sentence to the user. When the user modifies and inputs the divided translation order and the translation order specified for the divided small translation unit into the input unit 142, the small translation unit and the translation order, the small translation unit, You can save the translations in descending order. Whenever the user divides the original text into small translation units and determines the translation order, the user can determine the further division of the small translations, checking the translated small translations and the translated sentences for each small translation unit. In addition, whenever the user divides the original text into small translation units, the translation order matching unit 121 may search the translation order pattern DB 160 for each small translation unit, and the user may match the translated translation. According to the order of the translations, the small translations and the translations generated through the translation unit 130 and the translation sentences in which the small translations are sorted, are further divided, or no longer divided, the translated small translations and translations You can decide whether to use.
문미번역어 처리 유닛(133)은 번역어순패턴 DB(160)의 번역어순패턴과 매칭되는 원문의 소번역단위들이 번역어순패턴에 포함된 문미번역어를 상기 존재하는 소번역단위들의 번역문인 소번역문들 문미에 부가하고 번역어순에 맞는 조사, 어미 또는 전치사 번역어 등을 처리하여 번역문장을 완성한다. 문미번역어 처리 유닛(133)은 번역어순 규칙의 분할 쌍의 전치 품사 및 후치 품사의 관계에 따라 번역어순 순번을 결정할 때 분할된 소번역단위들에 지정되는 조사, 어미 및 전치사 번역어 등을 생성할 수 있다. 예를 들어 명사-동사의 분할 쌍이면 전치 품사인 명사가 속하는 앞 소번역단위에 주격조사를 생성하고 뒤 소번역단위에 동사 종결어미를 생성하여 지정할 수 있다. 또한, 명사-형용사 또는 명사-관계사의 분할 쌍에서는 역순 어순으로 형용사 또는 관계사가 명사를 수식하게 되는 것이 미리 결정될 수 있으므로, 후치품사인 관계사가 속하는 뒤 소번역단위에 '~는' 또는 '~하는' 등의 수식 어미를 생성하여 지정할 수 있다. The sentence translation unit 133 may include the small translation sentences, which are translations of the small translation units, wherein the small translation units of the original text matching the translation sequence pattern of the translation sequence pattern DB 160 are included in the translation sequence pattern. In addition to this, the translation sentence is processed by processing a search, a mother word, or a preposition translation in accordance with the translation order. The sentence translation processing unit 133 may generate a survey, a mother and a preposition translation, etc. assigned to the divided sub-translation units when determining the translation word order according to the relationship between the pre-part and the post-part of the division pair of the translation order rule. have. For example, if a noun-verb division pair is used, a main search may be created in the first sub-translation unit belonging to the noun that is a prepositional part, and a verb ending ending may be created in the later sub-translation unit. In addition, in the divided pair of noun-adjectives or noun-affiliates, it may be predetermined that adjectives or affiliates modify the nouns in reverse sequential order. You can create and specify a formula ending such as'.
문미 번역어 처리유닛(131)은 또한 각 소번역단위들에 대해 자동번역하고 번역어순 순번에 따라 정렬한 후 각 소번역단위들에 대해 자동번역으로 생성된 소번역문들 간에 필요한 수식어를 번역문장의 문법 및 맞춤법에 맞게 수정하고 추가하여 번역문장을 완성한다. The grammar translation unit 131 also automatically translates each of the small translation units, sorts them according to the translation order, and then converts the necessary qualifiers between the small translations generated by the automatic translation for each small translation unit. And modify and add spelling to complete the translation sentence.
소번역문 및 번역문 생성 유닛(130)은 소번역단위들에 대한 소번역문들 및 번역문장을 출력한다. 소번역문 및 번역문 생성 유닛(120)으로부터 출력된 소번역문들 및 번역문장은 사용자 인터페이스 유닛(140)으로 입력되어 사용자에게 디스플레이 유닛(141)을 통해 디스플레이 되고, 사용자는 복수의 소번역단위들에 대한 소번역문들과 완성된 번역문장에 대해 소번역문들을 입력 유닛(142)으로 수정하여 입력하면, 소번역메모리 생성 유닛(130)을 통해 소번역단위들 및 상기 소번역단위들에 대한 소번역문의 쌍을 포함하며 각 소번역단위들 간의 번역어순 순번을 포함하는 번역메모리를 번역메모리 DB(170)에 저장한다. The small translation and translation generation unit 130 outputs the small translations and the translated sentences for the small translation units. The small translations and the translated sentences outputted from the small translation and the translation generation unit 120 are input to the user interface unit 140 and displayed to the user through the display unit 141, and the user is provided with respect to the plurality of small translation units. When the small translations and the small translations are corrected and input into the input unit 142, the small translation units and the small translation sentences for the small translation units are input through the small translation memory generation unit 130. It includes and stores the translation memory including the translation order number between each small translation unit in the translation memory DB (170).
번역메모리 DB(170)에 저장되는 소번역단위들 및 소번역문들의 쌍 및 번역어순 정보를 포함하는 번역메모리는 소번역단위들 각각에 대한 소번역문들의 대응관계가 소단위 대응관계표시자들에 의해 표시되고, 상기 소단위 대응관계표시자들의 소번역단위위치정보들은 상기 소번역단위들 각각의 원문문장 내 소단위로 세분화된 위치를 지시할 수 있고, 상기 소단위 대응관계표시자들의 소번역문위치정보들은 상기 소번역문들 각각의 번역문장 내 위치를 지시할 수 있다. In the translation memory including the small translation units, pairs of small translations, and translation word order information stored in the translation memory DB 170, the correspondence of the small translations for each of the small translation units is indicated by the subunit correspondence indicators. The sub-translation unit position information of the sub-unit correspondence indicators may indicate the location subdivided into sub-units in the original text of each of the sub-translation unit, and the sub-translational position information of the sub-unit correspondence indicators Each of the translations may indicate a location within the translation sentence.
번역어순패턴 생성 유닛(150)은 또한 번역어순패턴 DB(160)에 저장된 번역어순패턴들을 기초로 확장될 수 있는 확장 번역어순패턴을 생성하여 번역어순패턴 DB(160)에 저장할 수 있다. 확장된 번역어순패턴은 원문문장의 형태소 분석된 품사들의 형태소열과 매칭시켜서 분할패턴 및 번역어순 순번을 지정하는데 사용될 수 있다. 확장된 번역어순패턴에 대해서는 도 4a 내지 도 4e에서 설명한다. The translation word order pattern generating unit 150 may also generate an extended translation word order pattern that may be extended based on the translation word order patterns stored in the translation word order pattern DB 160 and store the translation word order pattern DB 160. The extended translation order pattern can be used to specify the division pattern and the translation order by matching the morphological sequences of the stemmed parts of the original sentence. The expanded translation order pattern will be described with reference to FIGS. 4A to 4E.
소번역문 및 번역문 생성 유닛(130)은 번역메모리 DB(170)로부터 소번역단위 및 소번역문의 쌍인 소번역메모리들을 분석하여 확장 번역메모리를 생성하고 번역메모리 DB(170)에 저장한다. The small translation and the translation generation unit 130 analyze the small translation memories that are pairs of the small translation unit and the small translation from the translation memory DB 170 to generate an extended translation memory and store the translation memory in the translation memory DB 170.
도 2b는 본 발명에 따른 복문 분할 어순 규칙(6)의 일 실시예를 도시하고, 도 2c는 본 발명에 따른 단문 분할 어순 규칙(6)의 일 실시예를 도시한다. 본 발명에서 분할 어순 규칙은 분할과 동시에 전치 품사 및 후치 품사에 따라 어순이 결정되는 경우, 반드시 복문 분할 어순 규칙을 우선적으로 먼저 적용하여서 원문문장을 소번역단위들로 분할하고, 이후 단문 분할 어순 규칙을 적용하는 순서로 이루어진다. 이는 복문을 단문으로 분할하는데 필요한 절과 절을 연결하는 등위 접속사를 먼저 분할하고 각각 단문들에 대해 분할하여야 후속적으로 분할되는 각 단문들 내의 소번역단위들 간의 번역 어순이 복문인 원문문장 전체 번역어순이 되기 때문이다. 또한, 명사 및 동사의 분할쌍은 단문 분할 어순 규칙들 중에서 우선적으로 먼저 분할되도록 정해진다. 단문 내에서 주어부는 번역어순 순번이 술어부보다 앞서 번역되는 번역 어순을 갖기 때문이다. FIG. 2B shows one embodiment of a compound sentence division word order 6 according to the invention, and FIG. 2C shows an embodiment of a sentence division word order rule 6 according to the present invention. In the present invention, when the word order is determined according to the prepositional part and the postpartum part at the same time as the division, the original sentence is first divided into small translation units by first applying the compound sentence division word order first, and then the fragment division word order rule. It takes place in the order of applying. It is necessary to first divide the clauses and clauses connecting the clauses to divide the sentence into short sentences, and then split them for each of the short sentences. Because it is word order. Also, the division pairs of nouns and verbs are determined to be divided first among the short division word order rules. This is because the predicate in the short sentence has a translation order that is translated before the predicate.
분할 어순 규칙은 원문문장의 형태소열에서 2개의 소번역단위로 분할되는 전치품사 및 후치품사의 분할 쌍 리스트를 포함한다. 분할 규칙에 지정된 전치 품사 및 후치 품사 사이가 분할되어 전치 품사는 분할되는 2개의 소번역단위들 중 앞 소번역단위의 마지막 품사가 되고 후치품사는 분할되는 2개의 소번역단위들 중 뒤 소번역단위의 첫번째 품사가 된다. 또한, 분할 어순 규칙은 소번역단위 분할패턴으로 분할하는 규칙이외에, 분할 쌍의 전치 품사 및 후치 품사에 따라 2개의 소번역단위들로 분할될 경우 분할된 후 전치 품사가 속하는 앞 소번역단위와 후치 품사가 속하는 뒤 소번역단위 간의 번역어순에 대한 정보 및 분할 쌍 간의 우선순위 정보를 포함할 수 있다. The divisional word order rule includes a list of split pairs of pre-parts and post-parts that are divided into two sub-translational units in the morphological sequence of the original text. The pre-partial part of speech is divided between the pre-partial and post-partial parts specified in the division rule, and the pre-partial part is the last part of the previous sub-translational unit among the two sub-translational units that are divided, and the post-partial unit is the second sub-translational unit of the divided sub-translational units. Become the first part of speech. In addition, the division word order rule is divided into two sub-translational units according to the pre-part and post-part-of-speech of division pairs. After the part-of-speech belongs to the information on the translation order between the small translation unit and may include the priority information between the pairs of division.
도 2d는 도 2a의 원문문장에 대해 형태소 분석하고 품사를 태깅한 결과(20)를 도시한다. 원문문장에 포함된 부호(좌괄호, 우괄호, 쉼표, 및 마침표 등)는 형태소 분석에서 품사들과 동일하게 취급되어 하나의 품사처럼 태깅된다. '(' 및 ')'는 품사 태킹시 좌괄호 및 우괄호로 품사 태깅되며 PL 및 PR로 표시되고, 쉼표 및 마침표은 구두점으로 태킹되며 SY로 표시된다. FIG. 2D shows the result 20 of stemming and tagging the part-of-speech for the original text of FIG. 2A. The symbols included in the original sentence (left brackets, brackets, commas, and periods) are treated the same as parts of speech in morphological analysis and tagged as a part of speech. '(' And ')' are part-of-speech tags in left and right parentheses and are marked with PL and PR, and commas and periods are tagged with punctuation and marked with SY.
본 발명의 일 실시예에서, 분할 어순 결정 규칙은 그 사이가 분할될 2 개의 특정 품사들인 전치 품사 및 후치 품사의 분할 쌍들에 대한 리스트를 포함한다. 분할 어순 결정 규칙에 따라 원문을 복수의 소번역단위들로 분할하고 각 소번역단위들에 번역어순을 결정하는 것은 상기 분할 어순 결정 규칙의 분할 쌍과 일치하는 품사의 배열이 있으면 원문문장의 형태소열 중 일치하는 2개의 특정 품사들 사이를 2 개의 소번역단위들로 분할하고, 상기 분할 어순 결정 규칙의 분할 쌍에 지정된 순차 번역어순 또는 역순 번역어순에 따라 상기 분할된 2 개의 소번역단위들의 번역어순 순번을 결정하는 방식으로, 상기 형태소 분석 및 품사 태깅 유닛(110)을 통해 형태소 분석 및 품사 태깅한 원문문장을 구성하는 일련의 품사 및 부호의 형태소열을 문두부터 문미까지 복수의 소번역단위들로 분할하고, 상기 분할 어순 결정 규칙의 분할 쌍에 지정된 순차 번역어순 또는 역순 번역어순에 따라 상기 분할된 2 개의 소번역단위들의 번역어순 순번을 결정한다. 분할쌍 리스트에 포함된 전치 품사 및 후치 품사의 쌍들과 매칭되는 원문의 형태소열을 문두에서 문미까지 분할할 때까지 원문 전체에 대해 수행된다. In one embodiment of the present invention, the division word order decision rule includes a list for the partitioning pairs of the two specific parts of speech that are to be divided between the pre-part of speech and the post-part of speech. Dividing the original text into a plurality of small translation units according to the division word order determination rule and determining the translation word order in each sub-translation unit may include morphological sequence of the original sentence if there is an arrangement of parts of speech matching the division pair of the division word order determination rule. A translation between the two matching subparts of speech into two sub-translational units, and according to the sequential or reverse translation order specified in the division pair of the division word order determination rule. By determining the order number, the morphological sequence of the parts of speech and the code constituting the original sentence morphological analysis and part-of-speech tagging through the morphological analysis and the part-of-speech tagging unit 110 are converted into a plurality of small translation units from the beginning to the end. The divided two sub-translation groups according to the sequential translation order or the reverse translation order specified in the division pair of the division word order determination rule. Determine the order of translation of the above. The morphological sequence of the original text matching the pairs of pre-part and post-partial parts included in the split-pair list is performed on the entire text until the text is split from the front to the end.
도 2b는 5개의 분할 쌍의 리스트를 포함하고, 도 2c는 14개의 분할 쌍의 리스트를 포함한다. 도 2b 및 도 2c에서 순차어순 열에 체크되지 않은 분할 쌍은 역순 번역어순을 지시한다. 순차 번역어순에 따라, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정되고, 분할된 앞뒤 소번역단위들이 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키고, 역순 번역어순에 따라, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 분할된 앞뒤 소번역단위들이 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시킨다.FIG. 2B includes a list of five partition pairs, and FIG. 2C includes a list of fourteen partition pairs. In FIG. 2B and FIG. 2C, the division pairs not checked in the sequential order column indicate the reverse translation order. According to the sequential translation order, the sequential order between the divided back and forth sub-translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation order, before being divided. If the translation order number of a small translation unit other than the small translation unit has a sequence number equal to or greater than the increased sequence number, the translation sequence number assigned to the other translation units having the same or larger sequence number is 1; Determine the reverse order in descending order of the two sub-translational units that are divided, according to the reverse translation order. 1 is determined by the incremented translation order and the original translation order, and are not divided subtotal translation units. If the translation sequence number of another small translation unit has a sequence equal to or greater than the increased sequence number, the translation sequence number assigned to the other translation units having the same or greater sequence number is increased by one.
도 2b은 복문 분할 규칙의 일 예로써, 분할 쌍에 대한 번역어순 순번을 나타낸다. 분할 어순 규칙에서 접속사(AND/OR)는 모든 등위 접속사를 나타내는 것이 아니라 절과 절을 연결하는 등위 접속사만으로 한정되도록 내부적으로 프로그래밍될 수 있다. 즉, AND/OR 등위 접속사 앞뒤의 품사 중에 원문문장의 형태소열 중 최초 AND/OR가 있는 경우 최초 AND/OR 이전에 동사가 존재하고, 최초 AND/OR 이후부터 두번째 AND/OR를 포함하는 다음 접속사 또는 원문문장 끝까지 사이에 동사가 존재하는 경우만을 접속사 'AND/OR'로 식별하도록 프로그래밍될 수 있다. 만약 AND/OR 이전에 동사가 존재하지 않거나 AND/OR 이후부터 다음 AND/OR를 포함하는 다음 접속사까지 또는 원문문장 끝까지 사이에 동사가 존재하지 않으면 분할 어순 규칙의 접속사(AND/OR)가 아닌 것으로 프로그래밍할 수 있다. 접속사(AND/OR)-모든품사 분할 쌍은 절과 절을 연결하는 접속사(AND/OR)와 모든 품사 사이를 분할한다. 복문 분할 규칙에 모든품사-접속사(AND/OR) 및 접속사(AND/OR)-모든품사의 분할 쌍이 분할 규칙에 포함되면 절과 절을 연결하는 AND/OR는 앞뒤로 분할되어 단독으로 소번역단위를 구성할 수 있다. 2B is an example of a compound sentence segmentation rule, and illustrates a translation order sequence number for a segmentation pair. In the divisional word order rule, the conjunctions (AND / OR) can be internally programmed to represent only concatenations connecting clauses and clauses, rather than representing all equivalence conjunctions. That is, if the first AND / OR in the morphological sequence of the original sentence is found in the parts of speech before and after the AND / OR equivalence conjunction, the verb exists before the first AND / OR, and the next conjunction including the second AND / OR after the first AND / OR. Or it can be programmed to identify the conjunction 'AND / OR' only if a verb exists between the end of the original sentence. If the verb does not exist before AND / OR, or after the AND / OR until the next conjunction containing the next AND / OR, or until the end of the original sentence, it is not a conjunction (AND / OR) in the division word order rule. Can be programmed AND / OR-all parts-of-speech division A pair splits between clauses (AND / OR) and all parts-of-speech. If the segmentation rule includes a segmentation pair of all parts of a part-and-or (AND / OR) and all parts of a part of a conjunction (AND / OR) and all parts of a part, the AND / OR linking clauses to clauses is split back and forth to form a subtranslation unit alone. Can be configured.
도 2c는 단문 분할 어순 규칙의 일 예로써, 분할 쌍에 대한 번역어순 순번을 나타낸다. 도 2c의 분할 어순 규칙은 단문 분할어순 규칙의 일 실시예이고, 단문 분할 규칙은 순차 번역과 역순 번역이 되는 분할 쌍이 존재할 경우 이들 간의 선후 어순 결정에 따라 번역어순에 차이가 날 수 있기 때문에, 각 분할쌍들 간에 우선 순위를 미리 정해놓을 수 있다. 예를 들어 명사-동사, 과거분사-동사, 현재분사-동사, 명사-대명사는 원문문장의 주어부와 서술어부를 우선적으로 분할하도록 형태소 분석된 원문문장의 형태소열을 분할할 때 분할 쌍들 간에 우선순위를 정할 수 있다. 우선 순위가 정해지지 않은 분할 쌍들은 원문문장의 형태소 분석된 품사들의 형태소열의 문두부터 문미까지 순차적으로 매칭하여 분할 어순 규칙에 존재하는 분할 쌍에 따라 분할하고 번역어순 순번을 결정한다. 만약 우선 순위가 있는 분할 쌍이 있으면, 문두부터 문미까지의 순차적인 분할보다 우선적으로 분할 어순 규칙이 적용된다. 예를 들어, 접속사-모든 품사 및 명사-동사에 우선순위 1, 2가 지정되어 있으면 접속사-모든 품사의 분할 쌍이 명사-동사의 분할 쌍보다 뒤에 위치되더라도, 먼저 접속사-모든 품사를 먼저 분할하고 명사-동사가 다음으로 분할되고, 명사-동사 분할 쌍이 분할 쌍 리스트에 있는 다른 분할 쌍보다 먼저 분할된다. 예를 들어, 명사-동사, 과거분사-동사, 현재분사-동사, 명사-대명사의 분할 쌍 중 우선순위는 1, 2, 3, 4로 정할 수 있고, 나머지 다른 분할 쌍들에 대해서는 원문의 형태소열의 문두부터 문미까지 분할어순 규칙에 따라 분할이 수행된다. 분할 어순 규칙의 전치 품사 및 후치 품사는 언어에 따라 규칙이 달라질 수 있으며, 번역자가 분할하고자 하는 분할 쌍을 분할 쌍 리스트에 추가하거나 삭제함으로써 분할되는 소번역단위들을 조정할 수 있다. FIG. 2C is an example of a short division word order rule, and illustrates a translation word order number for a division pair. The split word order rule of FIG. 2C is an embodiment of the short word order rule, and since the short word split rule may have a difference in translation word order according to the order of word order between them when there are split pairs for sequential translation and reverse translation, Priority can be set between the pairs of partitions. For example, noun-verb, past participle-verb, present participle-verb, and noun-pronoun give priority to division pairs when dividing the morphological sequence of the original sentence that has been parsed to prioritize the subject part and the narrative part of the original sentence. You can decide. The unpaired partitioned pairs are sequentially matched from the beginning to the end of the morphological sequences of the stemmed parts of the original sentence, and divided according to the partitioned pairs in the partitioned word order, and the translation order is determined. If there is a partitioning pair with priority, the partitioning word order rule takes precedence over the sequential partitioning from the beginning to the end. For example, if priorities 1 and 2 are assigned to a conjunction-all part-noun and noun-verb, even if the division pair of the conjunction-all part-of-speech is located after the divisional pair of the noun-verb, the noun-all part-of-speech must first be divided and the nouns first. The verb is split into the next, and the noun-verb split pair is split before the other split pairs in the split pair list. For example, the priority among divisional pairs of noun-verb, past participle-verb, present participle-verb, and noun-pronoun can be set to 1, 2, 3, or 4, and the rest of the other division pairs Partitioning is done according to the rules of division word order from beginning to end. The pre- and post-part-of-speech parts of the divisional word order rule may vary according to the language, and the translator may adjust the sub-translational units that are split by adding or deleting the division pair to be divided into the division pair list.
문장에 대해 태깅된 품사들 중 하나가 등위 접속사이면, 분할된 소번역단위들의 번역어순의 순번은 순차로 증가되도록 결정되고, 이 경우 등위 접속사는 전치 품사 또는 후치품사가 되도록 분할 어순 규칙이 정해질 수 있고, 순차로 증가되도록 결정된다는 것은 분할되는 소번역단위의 앞쪽에 위치된 소번역단위의 소번역문이 원래 가지고 있던 번역어순 순번을 가지고 소번역단위의 뒤쪽에 위치된 소번역단위의 소번역문이 상기 번역어순 순번에 1이 증가된 번역어순 순번을 갖도록 결정되는 것을 의미한다. If one of the parts of speech tagged for a sentence is a conjunctive conjunction, the order of translations of the divided subtranslation units is determined to be sequentially increased, in which case the segmentation order is determined so that the conjunctive conjunction is a prepositional or postpartum. The small translation of the small translation unit located at the rear of the small translation unit may have a translation order of the original translation of the small translation unit located in front of the divided translation unit. It means that it is determined to have an increased translation order by 1 in the translation order.
또한, 원문문장에 대해 태깅된 품사들 중 하나가 종속 접속사이면, 후치 품사가 종속 접속사가 되도록 분할 어순 규칙이 결정될 수 있고, 분할된 소번역단위들의 번역어순의 순번은 역순으로 감소되도록 결정될 수 있다. 역순으로 감소되도록 결정된다는 것은 분할되는 소번역단위의 뒷쪽에 위치된 소번역단위가 분할전 소번역단위가 원래 가지고 있던 번역어순 순번을 가지고 소번역단위의 앞쪽에 위치된 소번역단위가 상기 분할전 소번역단위가 가지고 있던 번역어순 순번보다 1이 증가된 번역어순 순번을 갖도록 결정되는 것을 의미한다. Also, if one of the parts of speech tagged for the original text is a dependent conjunction, the divisional word order rule may be determined such that the post-partial speech is a dependent conjunction, and the order of translation word order of the divided sub-translation units may be determined to be reduced in reverse order. . It is determined that the order of decreasing in reverse order is that the small translation unit located at the rear of the divided small translation unit has the translation order number originally possessed by the small translation unit, and the small translation unit located at the front of the small translation unit before the division It means that it is determined to have the translation order number increased by 1 more than the translation order number that the small translation unit has.
단문 분할 어순 규칙에서 원문언어가 영어의 경우 명사는 예를 들어 전치 품사가 되어 명사 뒷부분이 앞쪽 소번역단위의 문미에 포함되도록 분할된다. 이는 명사 뒤를 기준으로 분할하면 분할 어순 규칙에 의해 기계적 분할 후 번역어순을 기계적으로 결정할 수 있기 때문이다. In the original word division rule, if the source language is English, the nouns are, for example, prepositional parts, and are divided so that the latter part of the noun is included in the suffix of the preceding sub-translational unit. This is because by dividing on the basis of the noun, the translation order can be mechanically determined after the mechanical division by the division order rule.
분할 어순 규칙은 번역자의 필요에 따라 분할 쌍의 리스트를 증감시킬 수 있다. 분할 어순 규칙의 분할 쌍의 리스트가 늘어나면 원문문장을 구성하는 소번역단위들의 갯수가 증가되어 번역어순으로 정렬해야하는 번역어순 순번이 증가하고 자동번역해야하는 소번역단위의 어절 수가 감소되어 자동번역이 정확해질 수 있으나, 번역자가 번역이 제대로 이루어졌는지 검수할 때, 검수 시간이 증가될 수 있다. 분할 쌍의 리스트가 감소하면 소번역단위들의 갯수가 감소하여 이에 지정되는 번역어순 순번의 수도 감소된다. The split word order rule can increment or decrement the list of split pairs as needed by the translator. As the list of split pairs in the split word order rule increases, the number of sub-translational units constituting the original sentence increases, so that the number of translations that need to be sorted in the order of translation increases, and the number of words of the small-translation units that need to be automatically translated decreases. This can be done, but the inspection time can be increased when the translator checks that the translation is done correctly. As the list of split pairs decreases, the number of small translation units decreases and the number of translation order assigned to it decreases.
도 3a 내지 도 3h는 기계적 분할 및 번역어순 결정 유닛(122)에서 도 2a의 원문(1)을 도 2b 및 도 2c의 분할 어순규칙에 의해 사용자가 아래의 원문문장을 번역 소단위인 소번역단위로 프로그램에 의해 기계적으로 분할한 후 분할된 소번역단위들 간의 번역어순의 순번을 결정하고 각 소번역단위들을 번역하는 일련의 과정에 대한 일 실시예를 도시한다. 3A to 3H show the original text 1 of FIG. 2A in the mechanical segmentation and translation word order determining unit 122 in accordance with the division order rules of FIGS. 2B and 2C. An embodiment of a series of processes of mechanically dividing by a program and then determining the order of translation between the divided small translation units and translating each small translation unit is shown.
A second operating BZ2 state can be a operating state at which a very small amount of fuel (in contrast to operating state BZ1) is metered, especiallly for catalyst heating.A second operating BZ2 state can be a operating state at which a very small amount of fuel (in contrast to operating state BZ1) is metered, especiallly for catalyst heating.
특정 품사들 간의 미리 결정된 분할 어순 결정 규칙은 상기 전치 품사 및 후치 품사의 분할쌍에 대해 분할될 앞뒤 소번역단위들이 올림차순의 순차 번역 어순인지 내림차순의 역순 번역어순인지를 지정한 분할어순 정보를 더 포함하고, 상기 2 개의 소번역단위들로 분할할 때마다 상기 분할 어순 결정 규칙의 분할 쌍의 분할어순 정보에 따라 2 개의 소번역단위들의 번역어순 순번을 결정한다. 상기 순차 번역어순에 따라, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정된다. 예를 들어, 분할된 2 개 소번역단위들 간의 번역어순 순번은 분할되기 전 원문문장 또는 소번역단위가 원래 가지고 있던 번역어순 순번과 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정된다. 예를 들어, 도 3a에서 분할되기 전 원문문장(1)은 도 3b에서 도 2c의 단문 분할어순 규칙(6)에 따라 명사-동사 사이를 분할하면, 분할 쌍의 분할된 2 개 소번역단위들(30, 31) 간의 번역어순 순번은 원문이 원래 가지고 있던 번역어순 순번 1과 상기 번역어순 순번에 1이 증가된 번역어순 순번(2)으로 결정된다. 또한, 도 3b에서 2번 소번역단위(31)는 번역어순 순번 2이지만, 2번 소번역단위(31)에 대해 단문 분할어순 규칙(6)에 따라 모든품사-관계사 사이를 분할하면(at which는 형태소 분석시 하나의 관계사처럼 분석되도록 미리 정해짐), 도 3c에서 분할된 2 개 소번역단위들(32, 33) 간의 번역어순 순번은 분할 전 소번역단위(31)가 원래 가지고 있던 번역어순 순번 2와 상기 번역어순 순번 2에 1이 증가된 번역어순 순번 3으로 결정되며, 도 2c의 분할어순 규칙에서 모든품사-관계사 분할 쌍의 번역어순은 역순 어순으로 정해져 있기 때문에, 분할되는 2개의 소번역단위들(32, 33) 중 뒤의 소번역단위(33)이 번역어순 순번 2로 결정되고 앞의 소번역단위(32)의 번역어순 순번은 3이 된다.  The predetermined division word order rule between specific parts-of-speech may further include subordination order information specifying whether the front and rear sub-translation units to be divided for the divisional pair of the pre-partial and post-partial parts are in ascending translation order or descending translation order. Each time the partition is divided into the two small translation units, the translation order of the two small translation units is determined according to the partition order information of the partitioning pair of the partition word order determination rule. According to the sequential translation order, the sequential order between the divided back and forth small translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation order. For example, the translation order number between two divided sub-translation units is determined by the translation order number originally possessed by the original text or the sub-translation unit and the translation order number increased by 1 to the translation order. For example, the original sentence 1 before splitting in FIG. 3A splits between nouns-verbs according to the short segmentation ordering rule 6 of FIG. 3B to FIG. The translation order number between (30, 31) is determined by the translation order number 1 that the original text originally had and the translation order number 2 increased by 1 to the translation order number. In addition, in FIG. 3B, the second small translation unit 31 is the translation order sequential number 2, but when the second small translation unit 31 is divided between all parts of speech and related companies according to the short sentence division order rule (6) (at which Is pre-determined to be analyzed as a related company in morphological analysis), and the translation order between the two small translation units 32 and 33 divided in FIG. 3C is the translation order originally possessed by the small translation unit 31 before the division. It is determined by the translation order number 3, which is increased by 1 to the translation number order 2 and the translation word order 2, and the translation word order of all the parts-related part division pairs in the division word order rule of FIG. 2C is determined to be the reverse order. The later small translation unit 33 of the translation units 32, 33 is determined to be the translation order sequential 2, and the translation order of the preceding small translation unit 32 is 3.
또한, 분할되는 2개의 소번역단위들이 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 번역어순 순번과 동일하거나 상기 증가된 번역어순 순번보다 큰 순번을 가지면 상기 동일하거나 증가된 순번보다 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 각각 하나 증가시킨다. 예를 들어, 도 3d의 3번 소번역단위(34)를 분할 어순 규칙의 모든품사-좌괄호의 분할 쌍에 의해 분할하면, 도 3e의 3번 소번역단위(36) 및 4번 소번역단위(37)로 분할되며, 3번 소번역단위(36) 및 4번 소번역단위(37)의 번역어순 순번은 분할 전 소번역단위(34)가 가졌던 순번 2와 1이 증가된 순번 3을 갖게 되며, 분할 어순 규칙에서 모든품사-좌괄호의 분할 쌍의 어순이 역순으로 정해져 있으므로, 3번 소번역단위(36) 및 4번 소번역단위(37)의 번역어순 순번은 각각 순번 3, 순번 2가 된다. 이 때 분할전 도 3d에서 분할되는 소번역단위(34)가 아닌 2번 소번역단위(32) 및 4번 소번역단위(35)의 번역어순 순번은 각각 4 및 3이었으나, 분할 후 도 3e의 3번 소번역단위(36) 및 4번 소번역단위(37) 중 3번 소번역단위(36)의 순번 3과 같거나 큰 순번을 가지므로, 각각 1이 증가된 3번 4번 소번역단위(36, 37)은 각각 4, 5가 된다. In addition, if the translation order number of the other small translation units other than the two sub-translation units that are divided has the same order as the increased translation order or the number greater than the increased translation order Increment the translation sequence number assigned to the other sub-translation units. For example, if the third small translation unit 34 of FIG. 3D is divided by all the parts-sequence pairs of parentheses in the division word order rule, the third small translation unit 36 and the fourth small translation unit of FIG. The translation order number of the small translation unit (36) and the fourth small translation unit (37) has (3) and the order number 2 and 1, which had the small translation unit (34) before the division, increased to 3 Since the order of division of all parts-left brackets in reverse division order is defined in reverse order, the translation order of the smallest translation unit (36) and the fourth small translation unit (37) is sequence number 3 and sequence number 2, respectively. Becomes At this time, the translation order number of the small translation unit 32 and the fourth small translation unit 35 rather than the small translation unit 34 divided in FIG. 3D before the division was 4 and 3, respectively. 3 small and 4 small translation units (36) and 4 small translation units (37) have the same or greater order than 3 of the small translation unit (36). (36, 37) becomes 4 and 5, respectively.
도 3g와 같이, 정해진 분할어순규칙의 분할 쌍 리스트에 따라 소번역단위들 및 각 소번역단위들에 번역어순 순번이 결정되면, 도 3g와 같이, 각 소번역단위들에 대한 번역문인 소번역문들(42)의 생성을 시작한다. 일 실시예로, 소번역문들(42)는 자동번역에 의해 생성된다. 다른 실시예에서, 소번역단위들과 매칭되는 번역메모리를 검색하여 소번역문(42)을 생성할 수 있다. 또 다른 실시예에서, 도 3b 내지 도 3h의 소번역단위로의 분할과 각 소번역단위들에 번역어순 순번이 결정될 때마다, 소번역문 및 번역문 생성유닛(130)에서 소번역문 및 번역문장 생성이 수행될 수 있다. 사용자는 도 3b 내지 도 3h의 소번역단위로의 분할과 각 소번역단위들에 번역어순 순번이 결정될 때마다 번역문장을 보면서 추가적인 분할 여부를 결정할 수 있다. As shown in FIG. 3G, when the translation order is determined in the small translation units and the respective small translation units according to the division pair list of the predetermined division word order, as shown in FIG. 3G, the small translations that are the translations for each of the small translation units. (42) Start of creation. In one embodiment, small translations 42 are generated by automatic translation. In another embodiment, the small translation 42 may be generated by searching for a translation memory matching the small translation units. In another embodiment, whenever the division into the small translation unit of FIG. 3B to FIG. 3H and the translation order are determined in each of the small translation units, the small translation and the translation sentence generation are generated in the small translation and translation generation unit 130. Can be performed. The user may determine whether to divide additionally by looking at the translation sentences whenever the division into the small translation units of FIGS. 3B to 3H and the translation order are determined in each of the small translation units.
도 3h는 문미 번역어 처리 유닛(133)을 통해 각 소번역문들의 문미 번역어들이 번역어순에 맞게 처리된 결과를 보여준다. 문미 번역어 처리 유닛(133)은 번역어순패턴에 포함된 소번역단위 분할패턴에 의해 소번역단위로 분할된 경우 소번역단위 분할패턴에 지정된 문미 번역어를 자동번역된 소번역문 또는 불러온 소번역문에 부가하고 중복되는 조사 어미, 전치사 번역어 등을 번역어순 순번에 맞게 삭제, 또는 수정한다. FIG. 3h shows the results of processing the translations of the translations of each of the small translations in the translation order through the translation translation processing unit 133. The literary translation processing unit 133 adds the translated sentence specified in the small translation unit division pattern to the automatically translated small translation or the imported small translation when the translation is divided into small translation units by the small translation unit division pattern included in the translation order sequence pattern. Delete or modify duplicate search endings, prepositional translations, etc. in order of translation order.
도 3i는 원문문장(1)이 단문 분할어순 규칙에 명사-동사, 모든품사-관계사의 분할 쌍만 적용되어 소번역문들 및 각 소번역문들에 번역어순 순번이 지정된 후, 각 소번역단위들에 대한 소번역문을 생성한 실시예이다. 3i shows that after the original sentence (1) is applied to only the noun-verb and all parts-related verbs in the short segmentation order, the translation order is assigned to the small translations and the respective small translations, and then for each small translation unit. An embodiment in which a small translation is generated.
3. 사용자 입력으로부터 번역어순패턴 데이터 생성3. Generation of translation order pattern data from user input
도 5a 내지 도 5n은 소번역단위 및 번역어순 결정유닛(120)에서 도 2a의 원문문장에 대해 입력 유닛으로부터 신호를 입력받아 원문문장을 번역 소단위인 소번역단위들로 분할하면서 원문문장에 대한 소번역단위들의 번역어순을 결정하기 위한 일 실시예를 도시한다. 5A to 5N show a small sentence unit for the original sentence, while receiving a signal from the input unit with respect to the original sentence of FIG. 2A in the small translation unit and the translation word order determination unit 120, and dividing the original sentence into small translation units which are translation subunits. One embodiment for determining the translation order of translation units is shown.
도 1의 사용자 인터페이스 유닛(140)에 포함된 입력 유닛(142)은 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는다. 입력유닛(142)은 도 5a의 원문문장 내 특정 위치(51) 및 순차 어순을 지정하는 제 1 신호를 입력받는다. 이에 따라, 도 5b에서와 같이, 특정 위치(51)을 기준으로 앞 소번역단위(30) 및 뒤 소번역단위(31)로 분할되고, 순차 어순에 의해 각각 1 및 2의 번역어순 순번이 결정된다. 입력 유닛(142)은 특정위치(51) 및 순차어순에 대한 정보를 예를 들어, '29-1'과 같이 생성하여, 소번역단위 및 번역어순 결정 유닛(120)으로 송신할 수 있다. 여기서 29는 원문문장의 문두에서 빈칸을 포함한 29번째 카운팅된 특정위치(51)를 나타내며, 1은 순차 어순을 나타낸다. '29-2'이면 2는 역순 어순을 나타낸다. The input unit 142 included in the user interface unit 140 of FIG. 1 includes specific location information in the original sentence or sub-translational unit and is arranged in descending order before and after the sub-translational units divided based on the specific position. A first signal indicating a sequential translation order, and a first order indicating a reverse translation order in descending order to the front and rear sub-translation units including specific position information in the original sentence or sub-translation unit and divided based on the specific position. 2 Receive a signal. The input unit 142 receives a first signal specifying a specific position 51 and a sequential word order in the original sentence of FIG. 5A. Accordingly, as shown in FIG. 5B, the first small translation unit 30 and the second small translation unit 31 are divided based on the specific position 51, and the translation order of 1 and 2 is determined by the sequential word order, respectively. do. The input unit 142 may generate the information about the specific position 51 and the sequential order as, for example, 29-1, and transmit the information to the small translation unit and the translation word order determination unit 120. Where 29 represents the 29th counted specific position 51 including a blank at the beginning of the original sentence, and 1 represents a sequential word order. If '29 -2 ', 2 represents the reverse order.
도 1의 소번역단위 및 번역어순 결정 유닛(120)은 입력 유닛으로부터 제 1 신호 또는 상기 제 2 신호를 수신하고, 제 1 신호 또는 제 2 신호에 따라 원문문장을 복수의 소번역단위들로 분할하고, 분할된 소번역단위들의 번역어순 순번을 결정한다. 소번역단위 및 번역어순 결정 유닛(120)은 상기 입력 유닛으로부터 상기 제 1 신호 또는 상기 제 2 신호를 수신하면, 번역어순패턴 매칭 유닛(121) 또는 기계적 분할 및 번역어순 결정유닛(122)에서 이미 결정된 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)이 있는 경우 이를 리셋하고, 제 1 신호 또는 상기 제 2 신호에 따라 상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위해 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 결정할 수 있다. The small translation unit and the translation order determination unit 120 of FIG. 1 receive the first signal or the second signal from the input unit, and divide the original text into a plurality of small translation units according to the first signal or the second signal. Then, the translation order of the divided small translation units is determined. When the small translation unit and the translation order determination unit 120 receive the first signal or the second signal from the input unit, the small translation unit and the translation word order determination unit 120 are already in the translation word order pattern matching unit 121 or the mechanical segmentation and translation order determination unit 122. If there are a plurality of small translation unit division patterns 300 and each of the plurality of small translation unit division patterns, the translation sequence number 5 is reset, and the original text is reset according to the first signal or the second signal. Splitting the plurality of small translation unit division patterns 300 and the plurality of small translation unit divisions including one or more parts of speech morphologically analyzed by morphological analysis of the original sentence to divide the sentence into a plurality of small translation units from the beginning to the end The translation order number 5 assigned to each of the patterns can be determined.
소번역단위 및 번역어순 결정 유닛(120)은 제 1 신호에 따라 제 1 신호가 지시하는 특정 위치(51, 52, 53, 54, 55)가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 올림차순의 순차 순번을 결정한다. 여기서, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정된다. 또한, 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시킨다. 예를 들어, 도 5g에서, 분할된 앞뒤 소번역단위들(34, 35) 간의 순차 순번(순번 2 및 순번 3)은 분할되기 전 소번역단위(33, 도 5e 참조)가 원래 가지고 있던 번역어순 순번(순번 2) 및 상기 번역어순 순번에 1이 증가된 번역어순 순번(순번 3)으로 결정된다. 또한, 제 1 신호가 지시하는 특정 위치(53)가 속하는 소번역단위(33)가 아닌 다른 소번역단위(32)의 번역어순 순번(순번 3)이 증가된 순번(순번 3)과 동일하면(도 5f 참조) 상기 동일한 순번을 갖는 다른 소번역단위들(32)에 지정된 번역어순 순번(순번 3)을 1만큼 증가(순번 4, 도 5g 참조)시킨다. 소번역단위 및 번역어순 결정 유닛(120)은 표시 유닛(141)으로 분할된 앞뒤 소번역단위 및 상기 앞뒤 소번역단위에 지정된 순번을 송신한다.  The small translation unit and the translation word order determining unit 120 refer to the small translation unit or the original sentence to which the specific position 51, 52, 53, 54, 55 indicated by the first signal belongs according to the first signal based on the specific position. In this example, two sub-translational units are divided and two sub-translational units are divided. Here, the sequential order between the divided front and rear sub-translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation language order. Further, if the translation order of the other translation unit other than the small translation unit to which the specific position indicated by the first signal belongs is the same as the increased order or has a greater order than the increased order, the other having the same or larger order Increment the translation order assigned to the small translation units by one. For example, in FIG. 5G, the sequential order (sequence 2 and 3) between the divided front and rear sub translation units 34 and 35 is the translation order originally possessed by the small translation unit 33 (see FIG. 5E) before being divided. It is determined by the order number (number 2) and the order number of translations (number 3) which is increased by 1 in the order of translation. Further, if the translation order sequential number (sequence 3) of the small translation unit 32 other than the small translation unit 33 to which the specific position 53 indicated by the first signal belongs is the same as the increased order (sequence 3) ( 5F) The translation sequence number (order 3) assigned to the other small translation units 32 having the same sequence number is increased by one (see sequence 4, FIG. 5G). The small translation unit and the translation word order determination unit 120 transmit the front and rear small translation units divided into the display unit 141 and the order number specified in the front and rear small translation units.
제 2 신호에 따라 제 2 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정한다. 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 상기 제 2 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시킨다. 예를 들어, 도 5j에서 분할된 앞뒤 소번역단위들(32, 37) 간의 역순 순번(순번 3 및 순번 2)은 분할되기 전 소번역단위(34)가 원래 가지고 있던 번역어순 순번(순번 2, 도 5h 참조)에 1이 증가된 번역어순 순번(순번 3) 및 원래 가지고 있던 번역어순 순번(순번 2)으로 결정되고, 상기 제 2 신호가 지시하는 특정 위치(54)가 속하는 소번역단위(34)가 아닌 다른 소번역단위(32, 35)의 번역어순 순번(순번 4, 순번 3)이 상기 증가된 순번(순번 3)과 동일하거나(도 5i참조) 상기 증가된 순번(순번 3)보다 큰 순번(순번 4)을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들(32, 35)에 지정된 번역어순 순번을 1만큼 증가시킨다. 또 다른 소번역단위들의 분할에서, 예를 들어, 도 5m에서 분할된 앞뒤 소번역단위들(38, 39) 간의 역순 순번(순번 5 및 순번 4)은 분할되기 전 소번역단위(35)가 원래 가지고 있던 번역어순 순번(순번 4, 도 5k 참조)에 1이 증가된 번역어순 순번(순번 5) 및 원래 가지고 있던 번역어순 순번(순번 4)으로 결정되고, 상기 제 2 신호가 지시하는 특정 위치(55)가 속하는 소번역단위(35)가 아닌 다른 소번역단위(32)의 번역어순 순번(순번 5)이 상기 증가된 순번(순번 5)과 동일하면(도 5l 참조) 상기 동일한 순번(순번 5)을 갖는 다른 소번역단위(32)에 지정된 번역어순 순번을 1만큼 증가시킨다. 소번역단위 및 번역어순 결정 유닛(120)은 표시 유닛(141)으로 분할된 앞뒤 소번역단위 및 상기 앞뒤 소번역단위에 지정된 순번을 송신한다. According to the second signal, the small translation unit or original sentence to which the specific position indicated by the second signal belongs is divided into two front and rear small translation units based on the specific position, and the reverse order of descending two divided translation units is divided. Determine the order. The reverse order between the divided front and rear sub-translation units is determined by the translation order number which is increased by 1 to the translation order sequence originally possessed by the small translation unit and the translation order sequence originally possessed by the second signal, and indicated by the second signal. Translation order assigned to other small translation units having the same or larger order if the translation sequence number of a small translation unit other than the small translation unit to which a specific position belongs is the same as or greater than the increased sequence number. Increases the sequence number by 1. For example, the reverse sequence number (sequence 3 and sequence 2) between the front and rear sub-translation units 32 and 37 divided in FIG. 5J is the translation sequence sequence number (sequence 2, 2) originally possessed by the small translation unit 34 before being split. 5H), the small translation unit (34) which is determined by the translation order number (number 3) and the original translation order number (number 2) having 1 increased, and to which the specific position 54 indicated by the second signal belongs. The translation order sequence (order number 4, sequence number 3) of the other small translation units 32, 35 other than) is the same as the increased sequence number (order 3) (see FIG. 5I) or greater than the increased sequence number (order 3). Having the sequence number (4) increases the translation sequence number assigned to the other small translation units 32, 35 having the same or larger sequence number by one. In another division of the small translation units, for example, the reverse order number (order 5 and number 4) between the front and rear sub translation units 38, 39 divided in FIG. It is determined by the translation order number (number 5) and the original number of translation word number (number 4) that 1 is increased to the translation word sequence number (number 4, see FIG. 5K), and the specific position indicated by the second signal ( 55) if the translation sequence number (sequence 5) of the small translation unit 32 other than the small translation unit 35 to which it belongs is equal to the increased sequence number (sequence 5) (see FIG. 5L), the same sequence (sequence 5) Increase the translation sequence number specified in the other small translation unit (32) having 1) by 1. The small translation unit and the translation word order determination unit 120 transmit the front and rear small translation units divided into the display unit 141 and the order number specified in the front and rear small translation units.
소번역문 및 번역문 생성유닛(130)은 소번역단위 및 번역어순 결정 유닛(120)으로부터 분할된 소번역단위들 및 이들에 지정된 번역어순 순번을 입력받아 소번역단위들 각각에 대한 소번역문들을 생성하고, 소번역단위들에 지정된 번역어순 순번에 따라 소번역문들을 정렬하여 번역문장을 생성하고 표시 유닛(141)으로 송신한다. 소번역문 및 번역문 생성유닛(130)에 포함된 문미 번역어 처리 유닛(133)은 소번역단위 및 번역어순 결정 유닛(120)으로부터 분할되는 분할패턴들의 형태소열 및 번역어순 순번에 대한 정보를 입력받아 각 소번역단위들에 대한 소번역문들의 문미 번역어를 번역문장이 완성되도록 처리할 수 있다. The small translation and the translation generation unit 130 receive the small translation units divided by the small translation unit and the translation word order determination unit 120 and the translation order assigned to them to generate the small translations for each of the small translation units. In accordance with the translation order assigned to the small translation units, the small translation sentences are arranged to generate a translation sentence, and are transmitted to the display unit 141. The sentence translation processing unit 133 included in the small translation and the translation generation unit 130 receives information on the morphological sequence of the split patterns divided by the small translation unit and the translation word order determination unit 120 and the translation word order. The transliteration of the small translations of the small translation units can be processed to complete the translation.
다른 실시예에서, 사용자는 표시 유닛(141)으로부터 원문문장에 대해 제 1 신호 또는 제 2 신호로부터 분할된 소번역단위들과 소번역단위들의 순번을 보고 입력 유닛(142)을 통해 새로운 제 1 신호 또는 제 2 신호를 입력할 수 있다. 사용자는 특정 위치에 대한 제 1 신호 또는 제 2 신호를 입력한 후 이에 따라 번역어순 순번을 결정하고 디스플레이된 원문문장에 대한 자동번역 결과를 보고 자동번역결과가 불만족스러운 경우 분할된 소번역단위들을 추가적으로 분할하기 위해 추가적으로 제 1 신호 또는 제 2 신호를 입력하거나, 직접 입력 유닛(141)을 통해 소번역문 또는 번역문장 중 수정할 부분을 입력할 수 있다. In another embodiment, the user sees the order of the small translation units and the small translation units divided from the first signal or the second signal for the original text from the display unit 141 and through the input unit 142 the new first signal. Alternatively, the second signal may be input. After inputting the first signal or the second signal for a specific position, the user determines the translation order and accordingly displays the result of the automatic translation of the displayed original text. In order to divide, the first signal or the second signal may be additionally input, or a portion of a small translation or a translation sentence may be input through the direct input unit 141.
표시 유닛(141)에서 소번역단위들 및 이들 각각에 지정된 번역어순 순번의 표시는 도 5a 내지 도 5n에서와 같이 원문문장 내에서 표시될 수도 있고 도 3a 내지 도 3i에서와 같이, 각 소번역단위들에 대한 소번역문들의 표시영역에 각 소번역단위에 지정된 순번들(5)을 표시할 수도 있다. 도 5b은 도 3b에 대응하고, 도 5d는 도 3c에 대응하고, 도 5g는 도 3d에 대응하고, 도 5j는 도 3e에 대응하며, 도 5m은 도 3f에 대응한다. In the display unit 141, the display of the sub-translation units and the translation order assigned to each of them may be displayed in the original text as shown in Figs. 5A to 5N, and as shown in Figs. 3A to 3I, each sub-translation unit The order numbers 5 assigned to each sub translation unit may be displayed in the display area of the sub translations. 5B corresponds to FIG. 3B, FIG. 5D corresponds to FIG. 3C, FIG. 5G corresponds to FIG. 3D, FIG. 5J corresponds to FIG. 3E, and FIG. 5M corresponds to FIG. 3F.
도 5n은 도 5a 내지 도 5m에서 소번역단위 및 번역어순 결정 유닛(120)에서 소번역단위들로의 분할 및 각 소번역단위들에 번역어순 순번이 결정된 후 소번역문 및 번역문 생성 유닛(130)에서 번역메모리 DB(170) 검색 또는 자동번역을 통해 각 소번역단위들에 대한 소번역문들이 생성되고 번역어순 순번에 따라 소번역문들이 정렬되어 번역문장이 생성된 것을 도시한다. FIG. 5N shows the small translation unit and the translation generation unit 130 after the division into the small translation units and the translation order are determined in each of the small translation units in the small translation unit and the translation word order determination unit 120 in FIGS. 5A to 5M. In the translation memory DB 170 search or automatic translation, small translations for each small translation unit are generated, and small translations are sorted according to the order of translation.
4. 4. 번역어순패턴Translation pattern DB에 저장된  Stored in DB 번역어순패턴을The translation order pattern 이용한  Used 소번역단위로의In small translation units 분할 및 번역어순 순번 결정 및 번역어순패턴 생성 Determination of segmentation and translation order and generation of translation order
도 6a 내지 도 6h는 도 2a의 원문이 도 4a 내지 도 4d의 번역어순패턴들로 번역어순패턴 DB(160)에 저장된 후 저장된 번역어순패턴을 기초로 번역문장을 생성하는 일련의 과정을 도시한다. 6A to 6H illustrate a series of processes for generating a translation sentence based on the stored translation order pattern after the original text of FIG. 2A is stored in the translation order pattern DB 160 as the translation order patterns of FIGS. 4A to 4D. .
도 6a는 새롭게 번역될 원문문장(601)이고, 도 6b는 원문문장(601)을 형태소 분석하고 태깅한 결과(650)이다. 도 6c는 번역어순패턴 DB(160)를 검색하여서 원문문장(601)을 형태소 분석하고 태깅한 형태소 분석 결과인 형태소열(651)과 매칭되는 번역어순패턴(462)을 도시한다. 형태소열은 원문문장(651)을 형태소 분석하고 태깅한 형태소분석 결과이다. 도 6d는 원문문장(601)의 형태소열 중 하나 이상이 번역어순패턴 DB(160)를 검색된 번역어순패턴(464)의 소번역단위 분할패턴들 각각의 시작부 및 종료부와 매칭되는 것을 도시한다. FIG. 6A is a text sentence 601 to be newly translated, and FIG. 6B is a result 650 of stemming and tagging the text sentence 601. 6C illustrates a translation word order pattern 462 matching the morpheme sequence 651 which is a result of morphological analysis and tagging of the original sentence 601 by searching the translation word order pattern DB 160. The morpheme sequence is the result of the morphological analysis of the original sentence 651 by morphological analysis and tagging. FIG. 6D shows that one or more of the morphological sequences of the original sentence 601 match the beginning and end of each of the small translation unit division patterns of the searched translation order pattern 464 in the translation order pattern DB 160. .
도 6e는 매칭된 번역어순패턴(462) 또는 번역어순패턴(464)의 소번역단위 분할패턴들에 의해 소번역단위들(530)로 분할되고 번역어순 순번(5)을 표시한 결과이다. FIG. 6E is a result of dividing the small translation unit 530 by the small translation unit division patterns of the matched translation word order pattern 462 or the translation word order pattern 464 and displaying the translation order order 5.
도 6f는 각각의 소번역단위들(530)을 소번역문 및 번역문 생성 유닛(130)을 통해 자동번역한 결과(642)이다. 6F is a result 642 of automatic translation of each of the small translation units 530 through the small translation and the translation generation unit 130.
도 6g는 문미번역어 처리 유닛(133)에 의해 번역어순패턴(462, 464)의 문미 번역어를 추가한 결과(643)를 도시한다. FIG. 6G shows a result 643 obtained by adding a translation of the translation of the translation order- order patterns 462 and 464 by the translation-processing unit 133.
도 6h는 문미번역어 처리 유닛(133)에 의해 각 소번역단위 분할패턴들의 분할패턴 종료부와 분할패턴 시작부의 분할패턴 쌍으로부터 정해지는 조사, 어미 또는 전치사 번역어 등의 문미 번역어를 고려하여 문미 번역어들을 처리한 결과를 도시한다. 이러한 처리가 끝나면 원문에 대한 번역문장이 완성된다. FIG. 6H illustrates the slang translations in consideration of a suffix translation such as a search, a ending or a prepositional translation, which is determined from a division pattern end of each sub-translation unit division patterns and a division pattern pair at the beginning of the division pattern by the sentence translation unit 133. The processing result is shown. After this processing, the translated text of the original text is completed.
5. 5. 소번역단위로의In small translation units 분할 및  Split and 번역어순English translation 순번을 결정하는  To determine the order 번역어순패턴을The translation order pattern 생성하는 방법 및 이를 수행하는 명령어들을 포함하는 컴퓨터 판독가능한 저장 매체 Computer-readable storage media containing methods for generating and instructions for performing the same
도 7a는 도 1a의 본 발명의 실시예에 따른 번역 장치 또는 프로그램에서 이용되는, 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하기 위한 방법 또는 이를 생성하는 기능들의 일 실시예이고, 도 7b는 도 1a의 본 발명의 실시예에 따른 번역 장치 또는 프로그램에서 이용되는, 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하고, 번역문을 생성하는 방법의 일 실시예이다. 이러한 방법들은 컴퓨터 실행가능 명령어에 의해 수행되고 상기 명령어는 컴퓨터 판독가능 기록 매체에 저장된다. FIG. 7A illustrates a translation word order pattern data structure for dividing an original text into small translation units that are translation subunits and determining translation order of the small translation units used in the translation apparatus or program according to the embodiment of FIG. 1A. 7B is an example of a method for generating or functions for generating the same, and FIG. 7B divides an original text into a small translation unit which is a translation subunit and is used in a translation apparatus or program according to the embodiment of the present invention of FIG. 1A. An embodiment of a method of determining the translation order of translation units and generating a translation is provided. These methods are performed by computer executable instructions and the instructions are stored on a computer readable recording medium.
본 발명에 따른 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되는 번역어순패턴 데이터 구조를 생성하기 위한 방법은 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 소번역단위 분할패턴 데이터들(300)을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 생성하는 단계;를 포함한다. The method for generating a translation order pattern data structure used for translation in a translation apparatus for translating an original sentence sentence (1) into a translation sentence sentence (2) according to the present invention includes a plurality of small translation units from the first sentence to the end of the original sentence. Generating sub-translational unit division pattern data 300 for dividing into sub-groups, wherein the sub-translational unit division pattern data 300 includes at least one part-of-speech part of a morpheme sequence obtained by morphological analysis of an original sentence; And generating translation word order data (5) designated in each of the plurality of small translation unit division pattern data (300).
원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법은, 원문문장을 불러오는 단계(1000 및 200), 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하는 단계(1010 및 2010), 원문문장을 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계(1020 및 2020), 분할된 소번역단위들 및 상기 소번역단위들의 결정된 번역어순을 사용자에게 표시하고 사용자로부터 입력을 수신하는 단계(1030 및 2030), 사용자에게 표시된 상기 소번역단위들 및 상기 소번역단위들에 결정된 번역어순 순번과 사용자가 입력한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 비교하고 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 포함하는 번역어순패턴을 생성하는 단계(1040 및 2040), 및 번역어순패턴을 번역어순패턴 DB(160)에 저장하는 단계(1050 및 2050)를 포함한다. 도 7a에서 번역어순패턴을 번역어순패턴 DB(160)에 저장하는 단계(1050) 이후에 다른 원문문장을 불러오면, 단계 1000부터 다시 상기 일련의 단계들을 다시 수행한다. 단계 1050에서 번역어순패턴 DB(160)에 저장된 번역어순패턴은 이후 원문문장을 번역할 때 번역어순패턴을 결정하기 위해 단계 1022에서 검색된다. The method for dividing the original text into sub-translational units, which are translation subunits, and determining the translation order of the sub-translation units, retrieving the original text (1000 and 200), morphologically analyzing the original text and applying the morphemes to the analyzed morphemes. Tagging parts of speech (1010 and 2010), dividing the original text into small translation units and determining translation order of the small translation units (1020 and 2020), the divided small translation units and the determined small translation units Displaying the translation order to the user and receiving input from the user (1030 and 2030), the translation order determined by the small translation units and the small translation units displayed to the user, and the small translation units input by the user, and Comparing the translation order number assigned to the small translation units and generating a translation order pattern including the small translation units and the translation order number assigned to the small translation units. And a step (1050 and 2050) for storing a phase (1040 and 2040), and the translated word order pattern on a translated word order pattern DB (160). In FIG. 7A, when another original sentence is retrieved after the step 1050 of storing the translation order pattern in the translation order pattern DB 160, the series of steps are performed again from step 1000. The translation order pattern stored in the translation order pattern DB 160 in step 1050 is then retrieved in step 1022 to determine the translation order pattern when the original text is translated.
도 7b의 번역문을 생성하는 방법 또는 기능은 도 7a의 번역어순패턴을 결정하고 저장하는 단계 이후에 소번역문 및 번역문 생성 단계(2050) 및 소번역메모리 및 번역메모리 DB(170)에 소번역문 및 번역문을 저장하는 단계(2060)를 더 포함한다. 도 7b에서 소번역메모리 및 번역메모리 DB(170)에 소번역문 및 번역문을 저장하는 단계(2060) 이후에 다른 원문문장을 불러오면, 단계 2000부터 다시 상기 일련의 단계들을 다시 수행한다. The method or function of generating the translated sentence of FIG. 7B includes the small translation and the translated sentence generation step 2050 and the small translation memory and the translation memory DB 170 after determining and storing the translation word order pattern of FIG. 7A. The method may further include storing 2060. In FIG. 7B, when the original text is loaded after the small translation and the translation in the small translation memory and the translation memory DB 170 (2060), the series of steps are performed again.
원문문장을 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계(1020 및 2020)는 원문문장을 형태소 분석한 형태소열을, 번역어순패턴 DB(160)에 저장된 번역어순패턴의 소번역단위 분할패턴들의 형태소열들과 비교하여 전체적 또는 부분적으로 매칭되는 번역어순패턴에 따라 원문문장을 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계(1022 및 2022), 및 품사가 태깅된 하나의 원문문장의 형태소열에서 특정 품사들 사이를 분할하는 미리 결정된 분할 쌍을 포함하고 상기 분할 쌍에 의해 분할될 앞뒤의 소번역단위들이 올림 차순의 순차 번역어순인지 또는 내림 차순의 역순 번역어순인지를 지정한 분할 어순 결정 규칙에 따라 복수의 소번역단위들로 분할하고 상기 소번역단위들에 번역어순을 결정하는, 원문문장을 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계(1024 및 2024) 중 하나 이상을 포함한다. 단계 1020에서 단계 1022 및 단계 1024는 양쪽 중 어느 하나의 단계가 수행될 수도 있고 양쪽 모두가 수행될 수 있다. 단계 2020에서 단계 2022 및 단계 2024는 양쪽 중 어느 하나의 단계가 수행될 수도 있고 양쪽 모두가 수행될 수 있다. The step 1020 and 2020 of dividing the original sentence into small translation units and determining the translation word order of the small translation units may be based on a morpheme sequence obtained by morphological analysis of the original sentence. (1022 and 2022) dividing the original text into small translation units and determining the translation language order of the small translation units according to a translation word order pattern that is matched in whole or in part with the morphological sequences of translation unit division patterns. Contains a predetermined division pair for dividing between specific parts-of-speech in a morphological sequence of one original sentence tagged, and the preceding and subsequent sub-translation units to be divided by the division pair are in ascending translation order or in descending order. Original sentence which divides into a plurality of small translation units according to a division word order determination rule specifying whether or not a translation is in order, and determines the translation order in the small translation units. Dividing the into small translation units and determining the translation order of the small translation units (1024 and 2024). In step 1020, steps 1022 and 1024 may be performed by either of the steps or both. In step 2020, steps 2022 and 2024 may be performed by either step or both.
도 7c는 사용자가 입력한 신호들에 따라 원문문장을 번역 소단위인 소번역단위로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법 또는 기능 및 이를 이용하여 번역하는 방법 또는 기능의 일 실시예이다. FIG. 7C illustrates a method or function for dividing an original text into small translation units that are translation subunits according to signals input by a user, and determining the order of translation of the small translation units, and a method or function for translating using the same. to be.
원문문장을 번역 소단위인 소번역단위들로 분할하면서 원문문장에 대한 소번역단위들의 번역어순을 결정하기 위한 방법 또는 기능은 원문문장을 불러오고 사용자에게 표시하는 단계(3000), 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하는 단계(3010), 원문문장 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는 단계(3020), 입력 유닛으로부터 상기 제 1 신호 또는 상기 제 2 신호를 수신하고, 상기 제 1 신호 또는 상기 제 2 신호에 따라 원문문장을 복수의 소번역단위들로 분할하고, 분할된 상기 소번역단위들의 번역어순 순번을 결정하는 단계(3030)를 포함한다. The method or function for determining the translation order of the sub-translation units for the original text by dividing the original text into the sub-translational units which are the translation sub-units is to retrieve the original text and display it to the user (3000), and to stem the original text. And tagging the parts of speech in the analyzed morphemes (3010), including a specific position information in the original sentence, and indicating a sequential translation order in ascending order to the front and rear small translation units divided based on the specific position. Receiving a first signal and a second signal indicating a reverse translation order in descending order to front and rear sub translation units including specific position information in the original sentence or sub translation unit and divided based on the specific position ( 3020) Receiving the first signal or the second signal from the input unit, and recovers the original text according to the first signal or the second signal. Division of the cows into the translation unit, and a step 3030 to determine the translation word order sequence number of the divided said small translation unit.
형태소 분석 및 품사 태킹을 통해 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴을 생성하는 방법 또는 기능은 전술한 원문문장을 번역 소단위인 소번역단위들로 분할하면서 원문문장에 대한 소번역단위들의 번역어순을 결정하기 위한 방법 또는 기능의 단계들에 추가하여, 원문문장, 소번역단위들, 소번역단위들 각각에 지정된 번역어순 순번 및 번역문장을 사용자에게 디스플레이하고 사용자로부터 입력을 수신하는 단계(3050); 및 사용자로부터 입력을 수신하고, 입력이 제 1 신호 또는 제 2 신호이면 단계 3030, 단계 3050을 다시 수행하고, 입력이 제 1 신호 또는 제 2 신호가 아니면, 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 형태소 분석 및 품사 태킹을 통해 변환된 상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위해 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴을 생성하는 단계를 더 포함한다. In each of the plurality of small translation unit division patterns 300 and the plurality of small translation unit division patterns 300 for dividing the original sentence into a plurality of small translation units from the beginning to the end through morphological analysis and part-of-speech tagging. A method or function for generating a translation sequence pattern comprising a designated translation sequence number (5) is a method for determining the translation sequence of the small translation units for the original sentence while dividing the above original sentence into small translation units which are translation subunits. Or in addition to the steps of the function, displaying the original sentence, the small translation units, the translation sequence number and the translation sentence assigned to each of the small translation units to the user, and receiving an input from the user (3050); And receiving the input from the user, and if the input is the first signal or the second signal, perform steps 3030 and 3050 again, and if the input is not the first signal or the second signal, the divided small translation units and the small It includes one or more parts of the morphemes in which the original sentence is morphologically analyzed to divide the original sentence transformed by morphological analysis and part-of-speech tagging into a plurality of small translation units from the beginning to the end. The method may further include generating a translation word order pattern including a plurality of small translation unit division patterns 300 and a translation word sequence number 5 assigned to each of the plurality of small translation unit division patterns 300.
다른 실시예에서, 형태소 분석 및 품사 태킹을 통해 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴을 생성하는 방법 또는 기능은 전술한 번역어순패턴을 결정하는 방법 또는 기능에 부가하여 소번역단위들 각각을 자동번역하여 소번역문들을 생성하며 상기 소번역단위들에 지정된 번역어순 순번에 따라 상기 소번역문들을 정렬하여 번역문장을 생성하는, 각 소번역단위들에 대한 소번역문들 및 번역문을 생성하는 단계(3040), 원문문장, 상기 소번역단위들, 및 상기 소번역단위들 각각에 지정된 번역어순 순번, 번역문장을 사용자에게 디스플레이하고 사용자로부터 입력을 수신하는 단계(3050), 및 사용자로부터 입력을 수신하고, 입력이 제 1 신호 또는 제 2 신호이면 단계 3030, 단계 3040 및 단계 3050을 수행하고, 상기 입력이 제 1 신호 또는 제 2 신호가 아니면, 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 형태소 분석 및 품사 태킹을 통해 변환된 상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위해 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴을 생성하는 단계(3070)를 포함한다. In another embodiment, the plurality of small translation unit division patterns 300 and the plurality of small translation unit division patterns for dividing an original sentence into a plurality of small translation units from sentence to sentence through morphological analysis and part-of-speech tagging (300) A method or function for generating a translation word order pattern including a translation word order number (5) designated in each of the small translation units by automatically translating each of the small translation units in addition to the above-described method or function. Generating small translations and translations for each of the small translation units by generating the translation sentences by sorting the small translation sentences according to the translation order assigned to the small translation units (3040), original sentence, Displaying (3050) the small translation units, and a translation sentence sequence number assigned to each of the small translation units, a translation sentence to a user, and receiving an input from the user; And receiving the input from the user, and if the input is the first signal or the second signal, perform steps 3030, 3040, and 3050, and if the input is not the first signal or the second signal, the divided small translation units And a morpheme sequence obtained by stemming the original sentence into a plurality of small translation units from the beginning to the end of the original sentence converted through morphological analysis and part-of-speech tagging from the translation order assigned to the small translation units. Generating a translation word order pattern including a plurality of small translation unit division patterns 300 including a part-of-speech and a translation word sequence number 5 assigned to each of the plurality of small translation unit division patterns 300 (step 3070). It includes.
본 발명의 다른 실시예에서, 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 방법 또는 기능은 도 7c의 단계 3010과 단계 3020 사이에, 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계(3012), 및 결정된 번역어순을 사용자에게 표시하는 단계(3018)를 더 포함할 수 있다. 또한, 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계(3012)는 번역어순패턴 DB를 검색하여 번역어순을 결정하는 단계(3014) 및 원문 문장을 분할하고 및 어순결정규칙에 따라 어순을 결정하는 단계(3016)를 포함할 수 있다. 단계(3012)에서 단계 3014 및 단계 3016은 양쪽 중 어느 하나의 단계가 수행될 수도 있고 양쪽 모두가 수행될 수 있다. 결정된 번역어순을 사용자에게 표시하는 단계(3018) 이후 입력 유닛으로부터 상기 제 1 신호 또는 상기 제 2 신호를 수신하면, 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계(3012)에서 결정된 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 리셋하고, 단계 3030에서, 상기 제 1 신호 또는 상기 제 2 신호에 따라 상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위해 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 결정할 수 있다. In another embodiment of the present invention, in the translation apparatus for translating the original sentence 1 into the translation sentence 2, the original sentence is divided into small translation units which are translation subunits, and the translation order of the small translation units is determined. The method or function includes the step 3012 of dividing the original text into small translation units and determining the translation order of the small translation units between steps 3010 and 3020 of FIG. 7C, and displaying the determined translation order to the user. Step 3018 may further include. In addition, the step 3012 of dividing the original sentence into small translation units and determining the translation order of the small translation units may be performed by retrieving the translation order pattern DB (3014) and dividing the original sentence. And determining 3016 in accordance with the word order determination rule. In step 3012, steps 3014 and 3016 may be performed in either or both of the steps. If the first signal or the second signal is received from an input unit after displaying the determined translation order to the user (3018), the original text is divided into small translation units and the translation order of the small translation units is determined. A plurality of small translation unit division patterns 300 determined in operation 3012 and a translation word order 5 assigned to each of the plurality of small translation unit division patterns are reset. In operation 3030, the first signal or the A plurality of sub-translational unit division patterns 300 including one or more parts of speech morphologically morphologically analyzed to divide the original sentence into a plurality of sub-translation units from the beginning to the end according to a second signal; and A translation word order (5) assigned to each of the plurality of small translation unit division patterns may be determined.
본 발명의 도 7a, 도 7b, 도 7c의 실시예에서, 소번역문 및 번역문 생성 단계(2050, 3040)는 실시예에 따라 생략될 수 있다. In the embodiments of FIGS. 7A, 7B, and 7C of the present invention, the small translations and the translation generation steps 2050 and 3040 may be omitted according to an embodiment.
6. 사용 언어 형태의 제한 없이 응용6. Application without restriction of language type
본 발명에서, 본 발명에 따른 번역어순패턴은 영어를 원문언어로 한국어를 목표언어로 하여 설명되었다. 그러나, 본 발명은 영어와 한국어에 한정되지 않고, 예를 들어 교착어, 굴절어, 고립어 간의 다른 원문 언어와 대상 언어 사이에서도 활용될 수 있다. 예를 들어, 일본어 및 중국어 뿐만 아니라, 독일어, 스페인어 등 다른 언어들 간에도 본 발명의 번역어순패턴을 번역에 적용시킬 수 있다.In the present invention, the translation order pattern according to the present invention has been described using English as the original language and Korean as the target language. However, the present invention is not limited to English and Korean, and may be utilized, for example, between other original languages and target languages between deadlocks, refractive words, and isolated words. For example, the translation word order pattern of the present invention can be applied to translation not only between Japanese and Chinese but also other languages such as German and Spanish.
전술한 설명은 본 발명의 단지 예시적인 실시예들을 개시한다. 본 발명의 범위 내에 속하는 위에 개시된 데이터 구조, 컴퓨터 실행가능 명령어들을 저장한 저장매체 및 방법의 변형예들은 당업자에게 매우 명백할 것이다. 따라서, 본 발명이 예시 실시예들과 관련하여 개시되었지만, 후속하는 청구 범위에 의해 정의되는 바와 같이, 당업자는 다른 실시예들이 본 발명의 범위 내에 있다는 것을 이해할 것이다. The foregoing description discloses only exemplary embodiments of the invention. Modifications of the above-described data structure, storage medium and method for storing computer-executable instructions within the scope of the invention will be readily apparent to those skilled in the art. Thus, while the present invention has been disclosed in connection with exemplary embodiments, those skilled in the art will understand that other embodiments are within the scope of the invention, as defined by the following claims.

Claims (22)

  1. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 컴퓨터 판독 가능 저장 매체에 저장된 번역어순패턴 데이터 구조로서,It is used for translation in a translation device for translating the original sentence (1) into the translation sentence (2) and the computer readable for dividing the original sentence into small translation units which are translation subunits and for determining the translation order of the small translation units. A translation order pattern data structure stored in a storage medium,
    상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하기 위한 소번역단위 분할패턴 데이터들(300) - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및Small translation unit division pattern data 300 for dividing the original sentence into a plurality of small translation units from the beginning to the end of the sentence-The small translation unit division pattern data 300 is in the morphological sequence of the morphological analysis of the original sentence. Includes one or more parts of speech; And
    상기 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5);을 포함하는,Includes; translation order number data (5) assigned to each of the plurality of small translation unit split pattern data (300),
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  2. 제 1 항에 있어서,The method of claim 1,
    상기 소번역단위 분할패턴 데이터들(300)에 의해 원문문장(1)으로부터 분할된 소번역단위들을 번역하고, 상기 번역어순 순번 데이터들에 따라 상기 소번역단위들 각각의 번역문인 소번역문들(4)을 정렬하여서 상기 원문문장의 번역문장(2)을 출력한 출력 데이터;를 더 포함하는,The small translation units divided from the original text sentence 1 by the small translation unit division pattern data 300 are translated, and the small translation sentences (4) which are translations of each of the small translation units are arranged according to the translation order number data. Output data outputting the translated sentence 2 of the original sentence by sorting); further comprising,
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  3. 제 1 항에 있어서,The method of claim 1,
    상기 소번역단위 분할패턴 데이터들(300) 각각은 각 소번역단위로의 분할의 시작을 지시하는 분할패턴 시작부, 중간부 및 각 소번역단위로의 분할의 종료를 지시하는 분할패턴 종료부로 구성되고, 상기 소번역단위 분할패턴 시작부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 하나 이상의 품사를 포함하고, 상기 분할패턴 종료부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 마지막에 위치된 하나 이상의 품사를 포함하고, 상기 중간부가 생략될 수 있는,Each of the small translation unit split pattern data 300 includes a split pattern start part which indicates the start of division into each small translation unit, a middle part, and a split pattern end part which indicates the end of division into each small translation unit. The small translation unit division pattern start part includes one or more parts of speech located at the front of the morphological sequences of the small translation unit division pattern data, and the division pattern end part is the last of the morphological sequences of the small translation unit division pattern data. One or more parts of speech located, wherein the intermediate portion may be omitted;
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  4. 제 1 항에 있어서,The method of claim 1,
    상기 소번역단위 분할패턴 데이터들(300) 각각은 각 소번역단위로의 분할의 시작을 지시하는 분할패턴 시작부, 중간부 및 각 소번역단위로의 분할의 종료를 지시하는 분할패턴 종료부로 구성되고, 상기 소번역단위 분할패턴 시작부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 품사로 구성되고, 상기 분할패턴 종료부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 마지막에 위치된 품사로 구성되고, 상기 중간부가 생략될 수 있는,Each of the small translation unit split pattern data 300 includes a split pattern start part which indicates the start of division into each small translation unit, a middle part, and a split pattern end part which indicates the end of division into each small translation unit. The small translation unit division pattern start part comprises a part-of-speech located at the front of the morphological sequences of the small translation unit division pattern data, and the division pattern end part is located last in the morphological sequences of the small translation unit division pattern data. Consists of parts of speech, the intermediate portion may be omitted,
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  5. 제 1 항에 있어서,The method of claim 1,
    상기 소번역단위 분할패턴 데이터들(300) 각각은 각 소번역단위로의 분할의 시작을 지시하는 분할패턴 시작부, 중간부 및 각 소번역단위로의 분할의 종료를 지시하는 분할패턴 종료부로 구성되고, 상기 소번역단위 분할패턴 시작부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 맨 앞에 위치된 하나 또는 두개로 구성되고, 상기 분할패턴 종료부는 상기 소번역단위 분할패턴 데이터의 형태소열 중 마지막 명사로부터 상기 소번역단위 분할패턴 데이터의 형태소열의 끝까지의 품사로 구성되고, 상기 중간부가 생략될 수 있는,Each of the small translation unit split pattern data 300 includes a split pattern start part which indicates the start of division into each small translation unit, a middle part, and a split pattern end part which indicates the end of division into each small translation unit. Wherein the small translation unit division pattern start part comprises one or two of the morphological sequences of the small translation unit division pattern data, and the division pattern end part is the last noun of the morphological sequences of the small translation unit division pattern data. Consisting of parts of speech from the small translation unit division pattern data to the end of the morphological sequence, wherein the intermediate portion may be omitted.
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  6. 제 1 항에 있어서,The method of claim 1,
    상기 소번역단위 분할패턴 데이터들에 의해 분할될 소번역단위의 소번역문의 문미에 위치될 조사, 어미 또는 전치사 대역어를 포함하는 문미 번역어가 상기 복수의 소번역단위 분할패턴 데이터들 중 하나 이상에 지정된,An end-to-end translation including a search, ending, or preposition band word to be located at the end of the small translation unit of the small translation unit to be divided by the small translation unit division pattern data is assigned to one or more of the plurality of small translation unit division pattern data. ,
    컴퓨터 판독 가능한 저장 매체에 저장된 번역어순패턴 데이터 구조.Translation pattern data structure stored in a computer readable storage medium.
  7. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체로서, 상기 컴퓨터 실행가능 명령어들은A translation word order pattern used for translation in a translation device for translating an original sentence sentence (1) into a translation sentence sentence (2) and dividing the original sentence sentence into small translation units which are translation subunits, and determining the translation order of the small translation units. A computer readable storage medium having stored computer executable instructions for generating a data structure, the computer executable instructions comprising:
    상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하고 상기 소번역단위들로부터 소번역단위 분할패턴 데이터들(300)을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및Dividing the original sentence into a plurality of small translation units from the beginning to the end and generating the small translation unit split pattern data 300 from the small translation units-the small translation unit split pattern data 300 Includes one or more parts of speech of the morphological sequence of the original sentence; And
    상기 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 생성하는 단계;를 수행하는,Generating translation word sequence data (5) designated in each of the plurality of small translation unit division pattern data (300);
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체.A computer readable storage medium storing computer executable instructions.
  8. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며, 원문문장을 번역 소단위들로 분할하고 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은It is used for translation in the translation device for translating the original sentence sentence (1) into the translation sentence sentence (2), and generates a translation word order pattern data structure for dividing the original sentence sentence into translation subunits and determining the translation order of the small translation units. A computer readable storage medium having stored thereon computer executable instructions, the computer executable instructions comprising:
    a) 원문문장을 불러오는 단계;a) retrieving the original text;
    b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; 및b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm; And
    c) 상기 제 1 항 내지 제 6항 중 어느 한 항에 따른 번역어순패턴 데이터 구조를 포함하는 번역어순패턴 DB(160)에 저장된 번역어순패턴의 소번역단위 분할패턴들의 형태소열들과 원문문장의 형태소열을 비교하여 매칭되는 번역어순패턴을 불러오고 상기 불러온 번역어순패턴의 소번역단위 분할패턴에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계;를 수행하는,c) morphological sequences of the small translation unit division patterns of the translation order pattern stored in the translation order pattern DB 160 including the translation order pattern data structure according to any one of claims 1 to 6; Comparing the morphological sequences to retrieve a matching translation order pattern, dividing the original text into small translation units according to the small translation unit division pattern of the imported translation order pattern, and determining the translation order of the small translation units; Doing,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  9. 제 8 항에 있어서, The method of claim 8,
    d) 상기 결정된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 사용자에게 표시하고 사용자로부터 입력을 수신하는 단계; 및d) displaying the determined small translation units and the translation order assigned to the small translation units to the user and receiving an input from the user; And
    e) 사용자로부터 입력을 수신하여 사용자에게 표시된 소번역단위들에 지정된 번역어순 순번과 비교하고, e) receive input from the user and compare it with the translation order assigned to the small translation units displayed to the user;
    사용자에게 표시한 소번역단위들에 지정된 번역어순 순번이 수정되었다면 사용자가 입력한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴 데이터 구조를 생성하고 번역어순패턴 DB(160)에 저장하고, If the translation order assigned to the small translation units displayed to the user has been corrected, a plurality of small translation units converted from the small translation units input by the user and the translation order assigned to the small translation units through morphological analysis and part-of-speech tagging Generating a translation order pattern data structure including a translation pattern order 5 assigned to each of the division patterns 300 and the plurality of small translation unit division patterns 300, and storing the translation order pattern data structure in the translation order pattern DB 160,
    수정되지 않았다면 사용자에게 표시한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 포함하는 번역어순패턴을 번역어순패턴 DB(160)에 저장하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 추가로 수행하는,Generating a translation order pattern data structure, which stores the translation order pattern including the small translation units displayed to the user and the translation order specified in the small translation units, if not corrected in the translation order pattern DB (160); To do more,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  10. 제 8 항에 있어서, The method of claim 8,
    상기 c) 상기 불러온 번역어순패턴의 소번역단위 분할패턴에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계는 C) dividing the original text into small translation units according to the small translation unit division pattern of the imported translation word order pattern, and determining the translation word order of the small translation units
    원문문장의 형태소열과 비교하여 원문문장의 형태소열들이 모든 소번역단위 분할패턴들 중 하나를 제외한 나머지 소번역단위 분할패턴들의 형태소열들과 매칭되는 번역어순패턴을 불러오거나, 또는 Fetches a translation word order pattern in which the morphological sequences of the original sentence match the morphological sequences of the remaining minor translation unit division patterns except one of all the minor translation unit division patterns, compared to the morphological sequences of the original sentence, or
    상기 원문문장을 소번역단위들로 분할하고, 상기 분할된 소번역단위들 각각을 형태소열로 변환했을 때, 상기 소번역단위 각각의 변환된 형태소열의 맨 앞에 위치된 하나 이상의 품사로 구성된 시작부 및 소번역단위들 각각의 형태소열 중 마지막에 위치된 하나 이상의 품사로 구성된 종료부와 매칭되는 번역어순패턴을 불러오는 것을 포함하는,When the original sentence is divided into sub-translational units, and each of the divided sub-translational units is converted into a morphological sequence, a beginning portion of one or more parts of speech located at the front of the converted morphological sequence of each sub-translational unit; and Importing a translation word order pattern that matches an end of one or more parts of speech located last in a morphological sequence of each sub-translation unit,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  11. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며, 원문문장을 번역 소단위들로 분할하고 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 It is used for translation in the translation device for translating the original sentence sentence (1) into the translation sentence sentence (2), and generates a translation word order pattern data structure for dividing the original sentence sentence into translation subunits and determining the translation order of the small translation units. A computer readable storage medium having stored thereon computer executable instructions, the computer executable instructions comprising:
    a) 원문문장을 불러오는 단계;a) retrieving the original text;
    b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm;
    c) 특정 품사들 간의 미리 설정된 분할 어순 결정 규칙에 따라 상기 형태소 분석된 원문문장을 복수의 소번역단위들로 분할하고 상기 소번역단위들에 번역어순을 결정하는 단계;c) dividing the morphologically analyzed original text into a plurality of small translation units according to a predetermined division word order determination rule between specific parts of speech and determining a translation word order in the small translation units;
    d) 상기 결정된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 사용자에게 표시하고 사용자로부터 입력을 수신하는 단계; 및d) displaying the determined small translation units and the translation order assigned to the small translation units to the user and receiving an input from the user; And
    e) 번역어순패턴 데이터 구조를 생성하는 단계로서, 사용자로부터 입력을 수신하여 e) generating a translation order pattern data structure, the method comprising receiving input from a user and
    사용자에게 표시된 소번역단위들에 지정된 번역어순 순번과 비교하고, Compare with the translation sequence number assigned to the small translation units displayed to the user,
    사용자에게 표시한 소번역단위들에 지정된 번역어순 순번이 수정되었다면 사용자가 입력한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴들(300) 및 상기 복수의 소번역단위 분할패턴들(300) 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴 데이터 구조를 생성하고, If the translation order assigned to the small translation units displayed to the user has been corrected, a plurality of small translation units converted from the small translation units input by the user and the translation order assigned to the small translation units through morphological analysis and part-of-speech tagging Generating a translation word order data structure including a translation pattern order (5) assigned to each of the division patterns (300) and the plurality of small translation unit division patterns (300),
    수정되지 않았다면 사용자에게 표시한 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 각각 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴들(300)으로서, 원문 문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300), 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴 데이터 구조를 생성하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 수행하는,If not corrected, the original sentence is a plurality of small translation unit division patterns 300 converted through morphological analysis and part-of-speech tagging from the small translation units displayed to the user and the translation order assigned to the small translation units, respectively. A translation word order data structure including a plurality of sub-translational unit division patterns 300 including one or more parts of speech analyzed among morpheme sequences, and a translation word sequence number 5 assigned to each of the plurality of sub-translation unit division patterns. Generating a translation order pattern data structure;
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  12. 제 11 항에 있어서, The method of claim 11,
    상기 c) 상기 형태소 분석된 원문문장을 복수의 소번역단위들로 분할하고 상기 소번역단위들에 번역어순을 결정하는 단계는C) dividing the morphologically analyzed original text into a plurality of small translation units and determining a translation word order in the small translation units.
    미리 설정된 특정 품사들 사이를 분할하는 분할 쌍을 포함하고, 상기 분할 쌍에 의해 분할될 앞뒤의 소번역단위들이 올림 차순의 순차 번역어순인지 또는 내림 차순의 역순 번역어순인지를 지정하는 상기 분할 어순 결정 규칙에 의해, 상기 분할 어순 결정 규칙의 분할 쌍과 일치하는 품사의 배열이 있으면 원문문장의 형태소열 중 일치하는 2개의 특정 품사들 사이를 2 개의 소번역단위들로 분할하고, 상기 분할 어순 결정 규칙의 분할 쌍에 지정된 순차 번역어순 또는 역순 번역어순에 따라 상기 분할된 2 개의 소번역단위들의 번역어순 순번을 결정하는 방식으로, 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하고, 상기 분할 어순 결정 규칙의 분할 쌍에 지정된 순차 번역어순 또는 역순 번역어순에 따라 상기 분할된 2 개의 소번역단위들의 번역어순 순번을 결정하는 단계를 포함하는,The division word order determination, including a division pair for dividing between predetermined part-of-speech parts, and specifying whether the front and rear sub-translation units to be divided by the division pair are sequential translation order in ascending order or reverse translation order in descending order According to the rule, if there is an arrangement of parts of speech that match the division pair of the division word order determination rule, the two partial parts of speech matching among the morphological sequences of the original sentence are divided into two small translation units, and the division word order determination rule The original sentence is divided into a plurality of small translation units from the beginning to the end of the sentence in a manner of determining the translation order of the divided two small translation units according to the sequential translation order or the reverse translation order specified in the division pair of. The divided two sub-translation units according to the sequential translation order or the reverse translation order specified in the division pair of the division word order decision rule. Determining a translated word order sequence,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  13. 제 12 항에 있어서, The method of claim 12,
    상기 분할 어순 결정 규칙의 분할 쌍에 지정된 순차 번역어순 또는 역순 번역어순에 따라 상기 분할된 2 개의 소번역단위들의 번역어순 순번을 결정하는 단계는 Determining the translation order number of the divided two small translation unit according to the sequential translation order or reverse translation order assigned to the division pair of the division word order determination rule
    상기 순차 번역어순에 따라, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정되고, 분할된 앞뒤 소번역단위들이 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키는 단계; 또는According to the sequential translation order, the sequential order between the divided front and rear sub-translation units is determined by the translation order number originally possessed by the small translation unit and the translation order number by which 1 is increased to the translation order. If the translation sequence number of the other translation units other than the preceding and subsequent sub-translation units has a sequence number equal to or greater than the increased sequence number, the translation sequence number assigned to the other translation units having the same or greater sequence number is 1; Increasing by; or
    상기 역순 번역어순에 따라, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 분할된 앞뒤 소번역단위들이 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키는 단계;를 포함하는According to the reverse translation order, the reverse order of descending order is divided into two sub-translational units that are divided, but the reverse order between the divided front and rear sub-translation units is 1 in the translation order originally possessed by the small translation unit. Determined by this increased translation order number and the original translation order number, and if the translation order number of a small translation unit other than the divided back and forth small translation units has the same order as the increased order number or greater than the increased order number Increasing the translation order number assigned to other small translation units having the same or larger order number by one;
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  14. 제 12 항에 있어서, The method of claim 12,
    상기 특정 품사들 간의 미리 설정된 분할 어순 결정 규칙은 분할 쌍들 사이에 먼저 분할되는 우선 순위를 지정할 수 있는,The predetermined division word order determination rule between the specific parts of speech may specify a priority to be divided first among division pairs.
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  15. 제 13 항에 있어서, The method of claim 13,
    상기 특정 품사들 간의 미리 결정된 분할 어순 결정 규칙은 절과 절을 연결하는 등위접속사를 포함하는 분할 쌍이 우선순위에서 먼저 분할되는 것을 포함하는, The predetermined division word order determination rule between the specific parts of speech includes that a division pair including a conjunctive conjunction connecting clauses and clauses is first divided in priority.
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  16. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며, 원문문장을 번역 소단위들로 분할하고 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 It is used for translation in the translation device for translating the original sentence sentence (1) into the translation sentence sentence (2), and generates a translation word order pattern data structure for dividing the original sentence sentence into translation subunits and determining the translation order of the small translation units. A computer readable storage medium having stored thereon computer executable instructions, the computer executable instructions comprising:
    a) 원문문장을 불러오고 사용자에게 표시하는 단계;a) retrieving the original text and displaying it to the user;
    b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -;b) stemming the original text and tagging parts of speech into the analyzed morphemes, wherein stemming the original text and tagging the parts of speech comprises stemming the original code and tagging the original code;
    c) 상기 원문문장 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는 단계; c) a first signal including the specific position information in the original sentence and indicating a sequential translation order in ascending order to the front and rear sub-translation units divided based on the specific position, and the specific in the original sentence or sub-translation unit Receiving a second signal including position information and indicating a reverse translation order in descending order to front and rear sub-translation units divided based on the specific location;
    d) 상기 제 1 신호 또는 상기 제 2 신호에 따라, 상기 원문문장을 복수의 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계; 및d) dividing the original text into a plurality of small translation units according to the first signal or the second signal and determining a translation word order of the small translation units; And
    e) 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 각각 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴들(300)로서, 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300), 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴 데이터 구조를 생성하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 수행하는,e) A plurality of sub-translational unit division patterns 300 converted through morphological analysis and part-of-speech tagging from the divided sub-translation units and the translation word order assigned to the sub-translation units, respectively, wherein the original sentence is morphologically analyzed. Generate a translation word order data structure including a plurality of sub-translational unit division patterns 300 including one or more parts of speech of morpheme sequences, and a translation word sequence number 5 assigned to each of the plurality of sub-translation unit division patterns. Generating a translation order pattern data structure;
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  17. 제 16 항에 있어서,The method of claim 16,
    f) 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번을 입력받고, 상기 소번역단위들 각각을 번역하여 소번역문들을 생성하며 상기 소번역단위들에 지정된 번역어순 순번에 따라 상기 소번역문들을 정렬하여 번역문장을 생성하는, 소번역문 및 번역문 생성 단계; 및f) receive the divided small translation units and the translation order assigned to the small translation units, translate each of the small translation units to generate small translations, and according to the translation order assigned to the small translation units Generating a translation sentence by arranging the small translation sentences; And
    g) 상기 원문문장, 상기 소번역단위들, 상기 소번역단위들의 번역어순 순번, 및 상기 번역문장을 사용자에게 표시하는 단계;를 추가로 수행하는,g) displaying the original sentence, the small translation units, the sequential order of the small translation units, and the translated sentence to a user;
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  18. 제 16 항에 있어서, The method of claim 16,
    원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계는The step of dividing the original text into small translation units and determining the translation order of the small translation units is
    상기 제 1 신호에 따라 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 올림차순의 순차 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키고, According to the first signal, the small translation unit or original sentence to which the specific position indicated by the first signal belongs is divided into two front and rear small translation units based on the specific position, and divided into two small translation units. The order of the ascending order is determined, but the order between the divided front and rear subtranslation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation order. Another sub-translation with the same or greater order if the translation order of the other translation unit other than the small translation unit to which the first signal is indicated has a sequence equal to or greater than the increased order Increase the translation order assigned to units by 1,
    상기 제 2 신호에 따라 상기 제 2 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키는 단계를 포함하는,According to the second signal, a small translation unit or original sentence to which a specific position indicated by the second signal belongs is divided into two front and rear small translation units based on the specific position, and divided into two small translation units. Decide the descending order of descending order, but the reverse order between the divided back and forth subtranslation units is determined by the translation order number increased by 1 to the original translation order number before the subdivision unit was divided, and the original translation order order Another sequence having the same or greater sequence number if the translation sequence number of the other translation unit other than the small translation unit to which the specific position indicated by the first signal belongs is the same as the increased sequence number or is greater than the increased sequence number. Increasing the translation order specified by the small translation units by one,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  19. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며, 원문문장을 번역 소단위들로 분할하고 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체로서, 상기 컴퓨터 실행가능 명령어들은 It is used for translation in the translation device for translating the original sentence sentence (1) into the translation sentence sentence (2), and generates a translation word order pattern data structure for dividing the original sentence sentence into translation subunits and determining the translation order of the small translation units. A computer readable storage medium having stored thereon computer executable instructions, the computer executable instructions comprising:
    a) 원문문장을 불러오는 단계;a) retrieving the original text;
    b) 원문문장을 형태소 분석하고 상기 분석된 형태소들에 품사들을 태깅하여 형태소열로 변환하는 단계 - 상기 원문문장을 형태소 분석하고 품사를 태킹하는 것은 원문 부호를 형태소 분석하고 원문 부호를 태킹하는 것을 포함함 -; 및b) morphologically analyzing the original sentence and tagging the parts of speech into the analyzed morphemes to form a morphological sequence-stemming the original sentence and tagging the parts of speech includes stemming the original code and tagging the original code. Hamm; And
    c) 상기 제 1 항 내지 제 6 항에 따른 번역어순패턴 데이터 구조를 포함하는 번역어순패턴 DB(160)에 저장된 번역어순패턴의 소번역단위 분할패턴들의 형태소열들과 원문문장의 형태소열을 비교하여 매칭되는 번역어순패턴을 불러오고 상기 불러온 번역어순패턴의 소번역단위 분할패턴에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하는 단계;c) comparing the morphological sequences of the small translation unit division patterns of the translation order pattern stored in the translation order pattern DB 160 including the translation order pattern data structure according to claim 1 to the original sentence. Importing a matching translation word order pattern, dividing an original text into small translation units according to the small translation unit division pattern of the imported translation word order pattern, and determining the translation word order of the small translation units;
    d) 상기 원문문장, 상기 소번역단위들, 및 상기 소번역단위들의 번역어순 순번을 사용자에게 표시하는 단계;d) displaying the original sentence, the small translation units, and the translation order of the small translation units to a user;
    e) 상기 원문문장 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 올림 차순의 순차 번역어순을 지시하는 제 1 신호, 및 상기 원문문장 또는 소번역단위 내 특정 위치 정보를 포함하며 상기 특정 위치를 기준으로 분할되는 앞뒤의 소번역단위들에 내림 차순의 역순 번역어순을 지시하는 제 2 신호를 입력받는 단계; 및e) a first signal which includes the specific position information in the original text and indicates the sequential translation order in ascending order to the front and rear sub-translation units divided based on the specific position, and the specific in the original sentence or sub-translation unit Receiving a second signal including position information and indicating a reverse translation order in descending order to front and rear sub-translation units divided based on the specific location; And
    f) 상기 원문문장을 상기 제 1 신호 또는 상기 제 2 신호에 따라 복수의 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하거나, 또는 상기 c)에 의해 결정된 소번역단위들을 상기 제 1 신호 또는 상기 제 2 신호에 따라 복수의 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계; 및f) dividing the original text into a plurality of small translation units according to the first signal or the second signal and determining the translation order of the small translation units, or the small translation units determined by c). Dividing into a plurality of small translation units according to a first signal or the second signal and determining a translation word order of the small translation units; And
    e) 상기 분할된 소번역단위들 및 상기 소번역단위들에 지정된 번역어순 순번으로부터 각각 형태소 분석 및 품사 태킹을 통해 변환된 복수의 소번역단위 분할패턴들(300)로서, 원문 문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함하는 복수의 소번역단위 분할패턴들(300), 및 상기 복수의 소번역단위 분할패턴들 각각에 지정된 번역어순 순번(5)을 포함하는 번역어순패턴 데이터 구조를 생성하는, 번역어순패턴 데이터 구조를 생성하는 단계;를 수행하는,e) A plurality of sub-translational unit division patterns 300 converted through morphological analysis and part-of-speech tagging from the divided sub-translation units and the translation word order assigned to the sub-translation units, respectively, wherein Generate a translation word order data structure including a plurality of sub-translational unit division patterns 300 including one or more parts of speech of morpheme sequences, and a translation word sequence number 5 assigned to each of the plurality of sub-translation unit division patterns. Generating a translation order pattern data structure;
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  20. 제 19 항에 있어서, The method of claim 19,
    상기 f) 상기 제 1 신호 또는 상기 제 2 신호에 따라 원문문장을 소번역단위들로 분할하고 상기 소번역단위들의 번역어순 순번을 결정하는 단계는F) dividing the original text into small translation units according to the first signal or the second signal and determining a translation order of the small translation units
    상기 제 1 신호에 따라 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 올림차순의 순차 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 순차 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번 및 상기 번역어순 순번에 1이 증가된 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키고, According to the first signal, the small translation unit or original sentence to which the specific position indicated by the first signal belongs is divided into two front and rear small translation units based on the specific position, and divided into two small translation units. The order of the ascending order is determined, but the order between the divided front and rear subtranslation units is determined by the translation order number originally possessed by the small translation unit and the translation order number increased by 1 to the translation order. Another sub-translation with the same or greater order if the translation order of the other translation unit other than the small translation unit to which the first signal is indicated has a sequence equal to or greater than the increased order Increase the translation order assigned to units by 1,
    상기 제 2 신호에 따라 상기 제 2 신호가 지시하는 특정 위치가 속하는 소번역단위 또는 원문문장을 상기 특정 위치를 기준으로 2 개의 앞뒤 소번역단위들로 분할하고, 분할되는 2 개의 소번역단위들에 내림차순의 역순 순번을 결정하되, 분할된 앞뒤 소번역단위들 간의 역순 순번은 분할되기 전 소번역단위가 원래 가지고 있던 번역어순 순번에 1이 증가된 번역어순 순번 및 원래 가지고 있던 번역어순 순번으로 결정되고, 상기 제 1 신호가 지시하는 특정 위치가 속하는 소번역단위가 아닌 다른 소번역단위의 번역어순 순번이 상기 증가된 순번과 동일하거나 상기 증가된 순번보다 큰 순번을 가지면 상기 동일하거나 큰 순번을 갖는 다른 소번역단위들에 지정된 번역어순 순번을 1만큼 증가시키는 단계를 포함하는,According to the second signal, a small translation unit or original sentence to which a specific position indicated by the second signal belongs is divided into two front and rear small translation units based on the specific position, and divided into two small translation units. Decide the descending order of descending order, but the reverse order between the divided back and forth subtranslation units is determined by the translation order number increased by 1 to the original translation order number before the subdivision unit was divided, and the original translation order order Another sequence having the same or greater sequence number if the translation sequence number of the other translation unit other than the small translation unit to which the specific position indicated by the first signal belongs is the same as the increased sequence number or is greater than the increased sequence number. Increasing the translation order specified by the small translation units by one,
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능한 저장매체.A computer readable storage medium storing computer executable instructions.
  21. 제 7 항 내지 제 20 항에 따른 컴퓨터 판독가능한 저장매체에 저장된 명령어들을 이용하여 번역을 수행하는, A translation is performed using instructions stored in a computer readable storage medium according to claims 7 to 20,
    컴퓨터 판독가능한 저장매체에 저장된 번역 프로그램.Translation programs stored on computer readable storage media.
  22. 원문문장(1)을 번역문장(2)으로 번역하기 위한 번역 장치에서 번역을 위해 이용되며 원문문장을 번역 소단위인 소번역단위들로 분할하고 상기 소번역단위들의 번역어순을 결정하기 위한 번역어순패턴 데이터 구조를 생성하는 컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체를 다운로드 가능한 형태로 배포하는 서버로서, 상기 컴퓨터 실행가능 명령어들은A translation word order pattern used for translation in a translation device for translating an original sentence sentence (1) into a translation sentence sentence (2) and dividing the original sentence sentence into small translation units which are translation subunits and determining the translation order of the small translation units A server for distributing, in downloadable form, a computer readable storage medium storing computer executable instructions for generating a data structure, the computer executable instructions comprising:
    상기 원문문장을 문두부터 문미까지 복수의 소번역단위들로 분할하고 상기 소번역단위들로부터 소번역단위 분할패턴 데이터들(300)을 생성하는 단계 - 상기 소번역단위 분할패턴 데이터들(300)은 원문문장을 형태소 분석한 형태소열 중 하나 이상의 품사를 포함함 -; 및Dividing the original sentence into a plurality of small translation units from the beginning to the end and generating the small translation unit split pattern data 300 from the small translation units-the small translation unit split pattern data 300 Includes one or more parts of speech of the morphological sequence of the original sentence; And
    상기 복수의 소번역단위 분할패턴 데이터들(300) 각각에 지정된 번역어순 순번 데이터들(5)을 생성하는 단계;를 수행하는,Generating translation word sequence data (5) designated in each of the plurality of small translation unit division pattern data (300);
    컴퓨터 실행가능 명령어들을 저장한 컴퓨터 판독가능 저장매체를 다운로드 가능한 형태로 배포하는 서버. A server for distributing computer readable storage media storing computer executable instructions in a downloadable form.
PCT/KR2016/002909 2016-03-16 2016-03-23 Data structure for determining translation order of words included in source language text, program for generating data structure, and computer-readable storage medium storing same WO2017159906A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2016-0031588 2016-03-16
KR1020160031588A KR20170107808A (en) 2016-03-16 2016-03-16 Data structure of translation word order pattern separating original text into sub-translation units and determining word order of sub-translation units, computer-readable storage media having instructions for creating data structure stored therein, and computer programs for translation stored in computer-readable storage media executing traslation therewith

Publications (1)

Publication Number Publication Date
WO2017159906A1 true WO2017159906A1 (en) 2017-09-21

Family

ID=59851030

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2016/002909 WO2017159906A1 (en) 2016-03-16 2016-03-23 Data structure for determining translation order of words included in source language text, program for generating data structure, and computer-readable storage medium storing same

Country Status (2)

Country Link
KR (1) KR20170107808A (en)
WO (1) WO2017159906A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019107623A1 (en) * 2017-11-30 2019-06-06 주식회사 시스트란인터내셔널 Machine translation method and apparatus therefor
KR102592630B1 (en) * 2018-11-21 2023-10-23 한국전자통신연구원 Simultaneous interpretation system and method using translation unit band corpus
KR102181677B1 (en) * 2018-12-18 2020-11-24 (주)아이브릭스 Method and apparatus for structuring patent claims

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020152081A1 (en) * 2001-04-16 2002-10-17 Mihoko Kitamura Apparatus and method for adding information to a machine translation dictionary
US20050060160A1 (en) * 2003-09-15 2005-03-17 Roh Yoon Hyung Hybrid automatic translation apparatus and method employing combination of rule-based method and translation pattern method, and computer-readable medium thereof
US20070150260A1 (en) * 2005-12-05 2007-06-28 Lee Ki Y Apparatus and method for automatic translation customized for documents in restrictive domain
KR20100069119A (en) * 2008-12-16 2010-06-24 한국전자통신연구원 Syntactic analysis method and its apparatus
KR20120046414A (en) * 2010-11-02 2012-05-10 에스케이플래닛 주식회사 Apparatus and method for providing result of mid-translation processing

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020152081A1 (en) * 2001-04-16 2002-10-17 Mihoko Kitamura Apparatus and method for adding information to a machine translation dictionary
US20050060160A1 (en) * 2003-09-15 2005-03-17 Roh Yoon Hyung Hybrid automatic translation apparatus and method employing combination of rule-based method and translation pattern method, and computer-readable medium thereof
US20070150260A1 (en) * 2005-12-05 2007-06-28 Lee Ki Y Apparatus and method for automatic translation customized for documents in restrictive domain
KR20100069119A (en) * 2008-12-16 2010-06-24 한국전자통신연구원 Syntactic analysis method and its apparatus
KR20120046414A (en) * 2010-11-02 2012-05-10 에스케이플래닛 주식회사 Apparatus and method for providing result of mid-translation processing

Also Published As

Publication number Publication date
KR20170107808A (en) 2017-09-26

Similar Documents

Publication Publication Date Title
US5794177A (en) Method and apparatus for morphological analysis and generation of natural language text
WO2014025135A1 (en) Method for detecting grammatical errors, error detecting apparatus for same, and computer-readable recording medium having the method recorded thereon
US6470306B1 (en) Automated translation of annotated text based on the determination of locations for inserting annotation tokens and linked ending, end-of-sentence or language tokens
WO2016125949A1 (en) Automatic document summarizing method and server
WO2014069779A1 (en) Syntax preprocessing-based syntax analysis apparatus, and method for same
Brill et al. Automatically acquiring phrase structure using distributional analysis
WO1997004405A9 (en) Method and apparatus for automated search and retrieval processing
JP2002215617A (en) Method for attaching part of speech tag
WO2015050321A1 (en) Apparatus for generating self-learning alignment-based alignment corpus, method therefor, apparatus for analyzing destructive expression morpheme by using alignment corpus, and morpheme analysis method therefor
WO2017159906A1 (en) Data structure for determining translation order of words included in source language text, program for generating data structure, and computer-readable storage medium storing same
US5088039A (en) System for translating adverb phrases placed between two commas through a converter using tree-structured conversion rules
GB2209614A (en) Translating apparatus
JPH0658676B2 (en) How to proofread multiple parts of speech
KR20120048139A (en) Automatic translation device and method thereof
WO2020209498A1 (en) Patent search method and device
US20040054677A1 (en) Method for processing text in a computer and a computer
Saquete et al. Evaluation of the automatic multilinguality for time expression resolution
WO1997048058A1 (en) Automated translation of annotated text
KR100374114B1 (en) Hyperlink generator for korean language terminology based HTML
JP4007630B2 (en) Bilingual example sentence registration device
WO2024019224A1 (en) Method for processing structured data and unstructured data in database, and data processing platform for providing method
JP2840258B2 (en) Method of creating bilingual dictionary and co-occurrence dictionary for machine translation system
KR20090066470A (en) System and method for korean morphological analysis using pre-analysis data
Diewald et al. TOKENIZING ON SCALE
JP2719453B2 (en) Machine translation equipment

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16894637

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16894637

Country of ref document: EP

Kind code of ref document: A1