Forum: Wordfast support
Topic: How can I pull out internal repetitions and internal fuzzies on WFP
Poster: Samuel Murray
Post title: Just a thought
[quote]jyuan_us wrote:
Is there a way to pull out the segments of repetitions/95-99% fuzzy matches into a separate document? [/quote]
I haven't tried any of this.
For only exact matches and external fuzzies:
Pre-translate a copy of the file against the TM, and then use WFP to create a bilingual export of the file (PM Perspective > Bilingual Export). Open that export in MS Word, and you should be able to sort the table by score. You should be able to then copy the source column (minus the non-matching segments that you removed) into a new TXML file, which the translator can translate (to create a translated translation memory).
For internal fuzzies:
Copy the source column from a bilingual export of the file into a new MS Word file (either all segments or only the ones that had no matches in the previous step), and open it in WFP. Start with a blank TM, and set WFP to copy source to target if there is no match, and to automatically insert the best fuzzy match. Then run through the segments one by one (this is where an AutoIt script can be useful, if you want to get up and make some coffee while the script runs through the document). Then, do a bilingual export again. Again the match percentages will be in the one column, and you should be able to remove segments that had no matches.
I'm no expert at WFP, but I think you need to set the following two options in Edit > Preferences: Translation Memory > Write unedited fuzzy and exact matches to a TM, and Copy source on mo match in editor. While you're at it, lower the match threshold to 50% (you can decide afterwards what threshold you want to give your translator). You'll also have to use Alt+Insert on the first segment to get the ball rolling, after that you can just press Alt+down. WFP will ask for confirmation once, if you try to accept an unedited fuzzy match.
How many files do you have?
Samuel
[Edited at 2015-12-06 16:39 GMT]
Topic: How can I pull out internal repetitions and internal fuzzies on WFP
Poster: Samuel Murray
Post title: Just a thought
[quote]jyuan_us wrote:
Is there a way to pull out the segments of repetitions/95-99% fuzzy matches into a separate document? [/quote]
I haven't tried any of this.
For only exact matches and external fuzzies:
Pre-translate a copy of the file against the TM, and then use WFP to create a bilingual export of the file (PM Perspective > Bilingual Export). Open that export in MS Word, and you should be able to sort the table by score. You should be able to then copy the source column (minus the non-matching segments that you removed) into a new TXML file, which the translator can translate (to create a translated translation memory).
For internal fuzzies:
Copy the source column from a bilingual export of the file into a new MS Word file (either all segments or only the ones that had no matches in the previous step), and open it in WFP. Start with a blank TM, and set WFP to copy source to target if there is no match, and to automatically insert the best fuzzy match. Then run through the segments one by one (this is where an AutoIt script can be useful, if you want to get up and make some coffee while the script runs through the document). Then, do a bilingual export again. Again the match percentages will be in the one column, and you should be able to remove segments that had no matches.
I'm no expert at WFP, but I think you need to set the following two options in Edit > Preferences: Translation Memory > Write unedited fuzzy and exact matches to a TM, and Copy source on mo match in editor. While you're at it, lower the match threshold to 50% (you can decide afterwards what threshold you want to give your translator). You'll also have to use Alt+Insert on the first segment to get the ball rolling, after that you can just press Alt+down. WFP will ask for confirmation once, if you try to accept an unedited fuzzy match.
How many files do you have?
Samuel
[Edited at 2015-12-06 16:39 GMT]