AW: AW: [xep-support] French hyphenation bug ?

From: Kurt Spescha <kurt.spescha@consor.ch>
Date: Mon Jan 22 2007 - 05:13:54 PST

Jaques,

You find a short decription on this link (FOP has the same
hyphenation-patterns as RenderX).

http://xmlgraphics.apache.org/fop/0.20.5/hyphenation.html

non-numeric characters represent characters in a sub-word to be evaluated

the period character (.) represents a word boundary, i.e. either the
beginning or ending of a word

numeric characters represent a scoring system for indicating the
acceptability of a hyphen in this location.
-Odd numbers represent an acceptable location for a hyphen, with higher
values overriding lower inhibiting values.
-Even numbers indicate an unacceptable location, with higher values
overriding lower values indicating an acceptable position.
-A value of zero (inhibiting) is implied when there is no number present. G

Generally patterns are constructed so that valuse greater than 4 are rare.
Due to a bug currently patterns with values of 8 and greater don't have an
effect, so don't wonder.

Here are some examples from the English patterns file:
Knuth (The TeXBook, Appendix H) uses the example hach4, which indicates that
it is extremely undesirable to place a hyphen after the substring "hach",
for example in the word "toothach-es".
.leg5e indicates that "leg-e", when it occurs at the beginning of a word, is
a very good place to place a hyphen, if one is needed. Words like "leg-end"
and "leg-er-de-main" fit this pattern.
 
rgrds

Kurt Spescha
Consor AG, CH-8006 Zürich
+41 (0)44 368 35 44
+41 (0)76 560 66 89
kurt.spescha@consor.ch

************ BITTE BEACHTEN ************

Diese Nachricht (wie auch allfällige Anhänge dazu) beinhaltet möglicherweise
vertrauliche oder gesetzlich geschützte Daten oder Informationen. Zum
Empfang derselben ist (sind) ausschliesslich die genannte(n) Person(en)
bestimmt. Falls Sie diese Nachricht irrtümlicherweise erreicht hat, sind Sie
höflich gebeten, diese unter Ausschluss jeder Reproduktion zu zerstören und
die absendende Person umgehend zu benachrichtigen. Vielen Dank für Ihre
Hilfe.

-----Ursprüngliche Nachricht-----
Von: owner-xep-support@renderx.com [mailto:owner-xep-support@renderx.com] Im
Auftrag von Jacques Foucry
Gesendet: Montag, 22. Januar 2007 13:46
An: xep-support@renderx.com
Betreff: Re: AW: [xep-support] French hyphenation bug ?

Le Lun 22 janvier 2007 11:47, Kurt Spescha a écrit :

Kurt,

> Sometimes the rules in the hyphenation file (originally from TeX/LaTeX)
> may
> be incomplete. So you have to add a rule. The easiest way to see, if the
> hyphenation file works, is to add a rule with the whole word, as ex.
> 'pan5ther'. Looking at the french file it seems anyway that there is no
> rule
> for how to hyphen 'nthe' (an 'n' before a 'th').

I had a look to this file and... It's a little bit complicated. Do you
know an url with explaination about all the symbols (., 2, 5), etc ?

Thanks in advance,

Jacques

-------------------
(*) To unsubscribe, send a message with words 'unsubscribe xep-support'
in the body of the message to majordomo@renderx.com from the address
you are subscribed from.
(*) By using the Service, you expressly agree to these Terms of Service
http://www.renderx.com/terms-of-service.html

-------------------
(*) To unsubscribe, send a message with words 'unsubscribe xep-support'
in the body of the message to majordomo@renderx.com from the address
you are subscribed from.
(*) By using the Service, you expressly agree to these Terms of Service http://www.renderx.com/terms-of-service.html
Received on Mon Jan 22 05:25:42 2007

This archive was generated by hypermail 2.1.8 : Mon Jan 22 2007 - 05:25:43 PST