[Tshwanelex-l] Digraph and trigraph sorting etc.

David Joffe david.joffe at tshwanedje.com
Thu Aug 2 22:41:18 EDT 2012

Oh yes, one quick 'caveat' - if you set up digraph sorting in the 
new version, save the file, then load the file in a pre-.653 version 
- the file should load, but the digraph sorting won't work properly 
anymore. (Hopefully that should be relatively obvious.)

On 3 Aug 2012 at 3:45, David Joffe wrote:

From:	"David Joffe" <david.joffe at tshwanedje.com>
To:	tshwanelex-l at mailman.yale.edu,
       	Menan Du Plessis <menan.du.plessis at gmail.com>
Date sent:	Fri, 03 Aug 2012 03:45:40 +0200
Priority:	normal
Subject:	[Tshwanelex-l] Digraph and trigraph sorting etc.

> Hi all,
> A test version of the (long-awaited) digraph (and trigraph etc.) 
> supporting version of the table-based configurable sorting in TLex 
> is now available:
> http://tshwanedje.com/builds/653/
> It would be great if it could get a little more testing before being 
> rolled into the 'formal' build, so anyone waiting for this 
> functionality, please give it a go. A small sample is also attached.
> Basically the old user interface for configuring the sorting has 
> changed in only one way (which you can't miss): Whereas before, all 
> variant forms where on one line, e.g.:
> a -> variant-forms: Aa
> now, the variant forms are separated by line breaks, e.g.:
> a -> variant-forms:
> A
> a
> so this allows a digraph to be entered as, for example:
> ch -> variant-forms:
> ch
> CH
> Ch
> cH
> If it's not reasonably clear let me know?
> Note this same system should be able to work out the box (well, with 
> some configuration of course per dictionary etc.) not only for 
> digraphs (or trigraphs), but also other cases where one alphabetic 
> 'letter' may be represented by a string of 2 or more Unicode 
> characters as a variant form - so e.g. that would include combining 
> diacritic forms, and also non-canonical variants (that basically 
> gmholton at alaska.edurefers to when there is more than one way in 
> Unicode to represent the same character, e.g. e with accent can be 
> represented, for example, either as the single precombined form:
> U+00e9
> or, slightly more unusually, as e.g. a plain latin 'e' with 
> combining accent:
> U+0065 + U+0301
> (However, for most users compiling dictionaries, implementing that 
> level within the sorting is not generally necessary, and can anyway 
> be worked around in such a controlled environment. But anyway, it 
> might be of theoretical or even practical interest to some.)
>  - David
On 17 Jul 2012 at 8:19, Menan Du Plessis wrote:
> Date sent:	Tue, 17 Jul 2012 08:19:00 +0200
> From:	Menan Du Plessis <menan.du.plessis at gmail.com>
> To:	tshwanelex-l at mailman.yale.edu
Subject:	[Tshwanelex-l] sending sample to a 3rd party; configuring sorting
> > 
> > Hi everyone.
> > 
> > I need to send a sample soon to the people who run the electronic 
> > archive (ELAR, at SOAS) where our material will be deposited. (Am 
> > working on an extremely endangered language, with essentially one last 
> > speaker, who is 100 years old.) So my first urgent question is:
> > 
> > Q: How do I organize my sound files plus the dictionary so that my 
> > recipient will be able to click on the icon and hear the recorded 
> > examples I have linked to my entries? Is there a way to put them both 
> > into one folder, and then specify the directory in a neutral way, i.e. 
> > that doesn't include a drive on my own computer as part of the address? 
> > 
> > And my second urgent question is:
> > 
> > Q: How do I configure the sorting where digraphs, trigraphs (and 
> > polygraphs) need to be accommodated on the first pass?
> > 
> > I know that David will soon be getting to me with advice on the second 
> > question - and I do understand that everyone has been super busy getting 
> > the 2012 version ready for release ... () - but in the mean time, it 
> > occurs to me that this must be a pretty common need, so I'd like to put 
> > it out there, in case anyone else has already worked out how to do this?
> > 
> > I am starting to fret very slightly!
> > Best wishes,
> > Menan

