Events service (kontact) could take a lot of ram on my computer (4 GiB for example). I think most part are taken for storing mail indexes. But most people are not aware about mails in some language (for example: spams, advertisement or information from portal user do not look at mails often).
Solution is simple:
- Allow to turn on per-language indexing
- If message are recognized to be written in some language, it will be indexed in index related to one’s language
- User may select, which language to keep in mail and can select language field in search dialog window, so only messages in specified languages was searched
- User may select how many words in some language must be placed in mail to index in this language-related index (word number or percentage count of words; one word could belongs to many language, like car: person higher than king in hierarchy in Polish and car/automobile in English)
- They could be selected languages to be added to white list and black list, so Kontact could move some messages to spam
Firstly, I think user will never search in some languages, so why kept index in memory?
Additionally, there could be other domain of splitting messages, like from.