So if i resume the requests :
With "
année" as searchstring, it could be interesting that AjaxSearch retrieve :
1/
année
2/
annee
3/
année
in the content of documents;
1/ ok
2/ the main problem is to interpret the searchword as a word with one or several accented character and then replace these characters by the unaccented equivalent character. For French, Spanish, italian and Portuguese, i know that each accented character have an unaccented equivalent character. Even, if the meaning of the word change, it could be possible to find the equivalent character. (Even in french for example ... "mais" means "but" and "maïs" means "corn"
)
But is it true for others languages ? Not sure for example that in Cyrillic, all the accented characters have an unaccented equivalent character, used and understood by people.
3/ as Sottwell, i think it will be time consuming and not efficient. I think it’s better to store the document contents in a "raw" format
With "annee" as searchstring, AjaxSearch should retrieve :
1/ année
2/ annee
3/ année
in the content of documents;
1/ unfortunately, without a dictionary (in the appropriate language) and a word recognition, i think it ’s not so easy to detect the equivalent accented word.
2/ ok
3/ as 1/ i think that it ’s not possible without a dictionary and a context analysis