<![CDATA[ AjaxSearch and accented characters - MODX Community Forums]]> https://forums.modx.com/thread/?thread=47879 <![CDATA[Re: AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277064 e-stonia Aug 30, 2010, 04:27 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277064 <![CDATA[Re: AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277063
I see that you have found a solution for everybody but I have same problem here: http://goeschke.at/suche.html try to search for example for "aber"

Evo 1.0.4 & AS 1.9.0
MySQL charset: UTF-8 Unicode (from beginning)
MySQL connection collation: utf8_unicode_ci (from beginning)
MODx character encoding: UTF-8 (from beginning)
TinyMCE Entity Encoding: raw (was named but I created new doc after changing to raw and checked that it works but still in AS results crap)

So seems like everything’s done correctly but still same issue. Probably I need to change server as on another server special characters work on AS but still hoping I don’t need about 20 email accounts to transfer with all old emails in webmail to new server:S

e-stonia Aug 28, 2010, 12:54 PM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277063
<![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277062 dev_cw Nov 28, 2007, 05:51 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277062 <![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277061 How could we ask for that ? And what will be the impacts if we do this change for those we use the current behaviour ?
In any case, the behaviour of tinyMCE should be highligthed to avoid troubles.]]>
coroico Nov 28, 2007, 05:38 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277061
<![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277060 sottwell Nov 28, 2007, 01:50 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277060 <![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277059

But if it is not possible (or feasible) to change the behaviour of AjaxSearch, doesn’t it make sense for TinyMCE to store accented characters in raw format by default? I mean, I’m certainly not the only one that starts creating a site and saves the search feature for last?

Converting charcters (and tables and databases in my case) is not the best way to spend ones time smiley If TinyMCE stored the characters in raw format by default I would probably have noticed if the characters became garbled.

Or would that mess things up for the english speaking majority?]]>
staed Nov 28, 2007, 01:29 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277059
<![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277058 Solr search server (a subproject of Lucene) with a client project now, and just deciding how to process and store the content in the indexes so I can configure the index schema requires some expertise on the subject. Luckily, my project is all English for now, otherwise, I’d have to be learning how to configure it with various language-specific word-stem analyzers. Ouch! My brain hurts.]]> opengeek Nov 27, 2007, 05:23 PM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277058 <![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277057
With "année" as searchstring, it could be interesting that AjaxSearch retrieve :
1/ année
2/ annee
3/ ann&eacute;e

in the content of documents;

1/ ok smiley

2/ the main problem is to interpret the searchword as a word with one or several accented character and then replace these characters by the unaccented equivalent character. For French, Spanish, italian and Portuguese, i know that each accented character have an unaccented equivalent character. Even, if the meaning of the word change, it could be possible to find the equivalent character. (Even in french for example ... "mais" means "but" and "maïs" means "corn" grin)
But is it true for others languages ? Not sure for example that in Cyrillic, all the accented characters have an unaccented equivalent character, used and understood by people.

3/ as Sottwell, i think it will be time consuming and not efficient. I think it’s better to store the document contents in a "raw" format

With "annee" as searchstring, AjaxSearch should retrieve :
1/ année
2/ annee
3/ ann&eacute;e

in the content of documents;

1/ unfortunately, without a dictionary (in the appropriate language) and a word recognition, i think it ’s not so easy to detect the equivalent accented word.
2/ ok smiley
3/ as 1/ i think that it ’s not possible without a dictionary and a context analysis
coroico Nov 27, 2007, 04:34 PM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277057
<![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277056 sottwell Nov 26, 2007, 07:20 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277056 <![CDATA[AjaxSearch and accented characters]]> https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277055
I still think that it would be better if AjaxSearch searched for entities as well as the raw characters. Should I file a bug report or is my request noted?]]>
staed Nov 26, 2007, 07:05 AM https://forums.modx.com/thread/47879/ajaxsearch-and-accented-characters?page=2#dis-post-277055