www.auctionsieve.com
The AuctionSieve forums
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Post new topic   Reply to topic
View previous topic :: View next topic  
Author Message
TedC
Active contributor


Joined: 13 Nov 2005
Posts: 9

PostPosted: Sun Nov 13, 2005 7:54 pm    Post subject: Character Sets Reply with quote

Neville, I have noticed that, spanning european countries with my searches, there are often character set problems with the catch words etc.

Would it be possible to do a character substitution for the problematic characters on catch and trash words and the return list from an eBay search?

ä=a
é, è, ê = e
ï = i
ö, œ = o
ü, ù = u
ß = ss

And the corresponding uppercases for AEIOU.
And possible a handful of others (I have no knowledge about spanish, for example).

This would at least catch or trash eg. Anais and Anaïs (notice the trema over the i) for french searches.

Best,
Ted
Back to top
View user's profile Send private message
nev
Site Admin


Joined: 15 Sep 2004
Posts: 1144
Location: Sydney, Australia

PostPosted: Mon Nov 14, 2005 1:40 pm    Post subject: Reply with quote

Ok I've added it to the list at Misc26
http://www.auctionsieve.com/forums/viewtopic.php?p=469#469


Last edited by nev on Tue Nov 15, 2005 10:30 am; edited 1 time in total
Back to top
View user's profile Send private message Visit poster's website
TedC
Active contributor


Joined: 13 Nov 2005
Posts: 9

PostPosted: Mon Nov 14, 2005 4:52 pm    Post subject: Reply with quote

Thanks, Nev.

I am not sure I really understand i18n, but I feel this is not what I had in mind. Or you understood me, but I didn't understand your reply ;-)

My idea was to *reduce* from i18n to ASCII for catch and trash words. The effect would be that no matter how sellers write things, with french accents, german umlauts etc. disappear, so that géle, gélé, gèle etc. would all fall into "gele".

Also, this was meant to be only for catch and trash, because for searches, these characters need to be retained: a search for "Muller" won't find "Müller" on ebay.de

I have one to add: ÿ (y-trema) should give y.

Best from France,
Ted
Back to top
View user's profile Send private message
nev
Site Admin


Joined: 15 Sep 2004
Posts: 1144
Location: Sydney, Australia

PostPosted: Tue Nov 15, 2005 10:12 am    Post subject: Reply with quote

Ah - ok, I've added it as Misc27
Back to top
View user's profile Send private message Visit poster's website
Display posts from previous:   
Post new topic   Reply to topic All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group