Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_nl.txt
------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_no.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_no.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_no.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_no.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,194 @@ + | From svn.tartarus.org/snowball/trunk/website/algorithms/norwegian/stop.txt + | This file is distributed under the BSD License. + | See http://snowball.tartarus.org/license.php + | Also see http://www.opensource.org/licenses/bsd-license.html + | - Encoding was converted to UTF-8. + | - This notice was added. + | + | NOTE: To use this file with StopFilterFactory, you must specify format="snowball" + + | A Norwegian stop word list. Comments begin with vertical bar. Each stop + | word is at the start of a line. + + | This stop word list is for the dominant bokmÃ¥l dialect. Words unique + | to nynorsk are marked *. + + | Revised by Jan Bruusgaard <[hidden email]>, Jan 2005 + +og | and +i | in +jeg | I +det | it/this/that +at | to (w. inf.) +en | a/an +et | a/an +den | it/this/that +til | to +er | is/am/are +som | who/that +pÃ¥ | on +de | they / you(formal) +med | with +han | he +av | of +ikke | not +ikkje | not * +der | there +sÃ¥ | so +var | was/were +meg | me +seg | you +men | but +ett | one +har | have +om | about +vi | we +min | my +mitt | my +ha | have +hadde | had +hun | she +nÃ¥ | now +over | over +da | when/as +ved | by/know +fra | from +du | you +ut | out +sin | your +dem | them +oss | us +opp | up +man | you/one +kan | can +hans | his +hvor | where +eller | or +hva | what +skal | shall/must +selv | self (reflective) +sjøl | self (reflective) +her | here +alle | all +vil | will +bli | become +ble | became +blei | became * +blitt | have become +kunne | could +inn | in +nÃ¥r | when +være | be +kom | come +noen | some +noe | some +ville | would +dere | you +som | who/which/that +deres | their/theirs +kun | only/just +ja | yes +etter | after +ned | down +skulle | should +denne | this +for | for/because +deg | you +si | hers/his +sine | hers/his +sitt | hers/his +mot | against +Ã¥ | to +meget | much +hvorfor | why +dette | this +disse | these/those +uten | without +hvordan | how +ingen | none +din | your +ditt | your +blir | become +samme | same +hvilken | which +hvilke | which (plural) +sÃ¥nn | such a +inni | inside/within +mellom | between +vÃ¥r | our +hver | each +hvem | who +vors | us/ours +hvis | whose +bÃ¥de | both +bare | only/just +enn | than +fordi | as/because +før | before +mange | many +ogsÃ¥ | also +slik | just +vært | been +være | to be +bÃ¥e | both * +begge | both +siden | since +dykk | your * +dykkar | yours * +dei | they * +deira | them * +deires | theirs * +deim | them * +di | your (fem.) * +dÃ¥ | as/when * +eg | I * +ein | a/an * +eit | a/an * +eitt | a/an * +elles | or * +honom | he * +hjÃ¥ | at * +ho | she * +hoe | she * +henne | her +hennar | her/hers +hennes | hers +hoss | how * +hossen | how * +ikkje | not * +ingi | noone * +inkje | noone * +korleis | how * +korso | how * +kva | what/which * +kvar | where * +kvarhelst | where * +kven | who/whom * +kvi | why * +kvifor | why * +me | we * +medan | while * +mi | my * +mine | my * +mykje | much * +no | now * +nokon | some (masc./neut.) * +noka | some (fem.) * +nokor | some * +noko | some * +nokre | some * +si | his/hers * +sia | since * +sidan | since * +so | so * +somt | some * +somme | some * +um | about* +upp | up * +vere | be * +vore | was * +verte | become * +vort | become * +varte | became * +vart | became * + Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_no.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_pt.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_pt.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_pt.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_pt.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,253 @@ + | From svn.tartarus.org/snowball/trunk/website/algorithms/portuguese/stop.txt + | This file is distributed under the BSD License. + | See http://snowball.tartarus.org/license.php + | Also see http://www.opensource.org/licenses/bsd-license.html + | - Encoding was converted to UTF-8. + | - This notice was added. + | + | NOTE: To use this file with StopFilterFactory, you must specify format="snowball" + + | A Portuguese stop word list. Comments begin with vertical bar. Each stop + | word is at the start of a line. + + + | The following is a ranked list (commonest to rarest) of stopwords + | deriving from a large sample of text. + + | Extra words have been added at the end. + +de | of, from +a | the; to, at; her +o | the; him +que | who, that +e | and +do | de + o +da | de + a +em | in +um | a +para | for + | é from SER +com | with +não | not, no +uma | a +os | the; them +no | em + o +se | himself etc +na | em + a +por | for +mais | more +as | the; them +dos | de + os +como | as, like +mas | but + | foi from SER +ao | a + o +ele | he +das | de + as + | tem from TER +à | a + a +seu | his +sua | her +ou | or + | ser from SER +quando | when +muito | much + | há from HAV +nos | em + os; us +já | already, now + | está from EST +eu | I +também | also +só | only, just +pelo | per + o +pela | per + a +até | up to +isso | that +ela | he +entre | between + | era from SER +depois | after +sem | without +mesmo | same +aos | a + os + | ter from TER +seus | his +quem | whom +nas | em + as +me | me +esse | that +eles | they + | estão from EST +você | you + | tinha from TER + | foram from SER +essa | that +num | em + um +nem | nor +suas | her +meu | my +à s | a + as +minha | my + | têm from TER +numa | em + uma +pelos | per + os +elas | they + | havia from HAV + | seja from SER +qual | which + | será from SER +nós | we + | tenho from TER +lhe | to him, her +deles | of them +essas | those +esses | those +pelas | per + as +este | this + | fosse from SER +dele | of him + + | other words. There are many contractions such as naquele = em+aquele, + | mo = me+o, but they are rare. + | Indefinite article plural forms are also rare. + +tu | thou +te | thee +vocês | you (plural) +vos | you +lhes | to them +meus | my +minhas +teu | thy +tua +teus +tuas +nosso | our +nossa +nossos +nossas + +dela | of her +delas | of them + +esta | this +estes | these +estas | these +aquele | that +aquela | that +aqueles | those +aquelas | those +isto | this +aquilo | that + + | forms of estar, to be (not including the infinitive): +estou +está +estamos +estão +estive +esteve +estivemos +estiveram +estava +estávamos +estavam +estivera +estivéramos +esteja +estejamos +estejam +estivesse +estivéssemos +estivessem +estiver +estivermos +estiverem + + | forms of haver, to have (not including the infinitive): +hei +há +havemos +hão +houve +houvemos +houveram +houvera +houvéramos +haja +hajamos +hajam +houvesse +houvéssemos +houvessem +houver +houvermos +houverem +houverei +houverá +houveremos +houverão +houveria +houverÃamos +houveriam + + | forms of ser, to be (not including the infinitive): +sou +somos +são +era +éramos +eram +fui +foi +fomos +foram +fora +fôramos +seja +sejamos +sejam +fosse +fôssemos +fossem +for +formos +forem +serei +será +seremos +serão +seria +serÃamos +seriam + + | forms of ter, to have (not including the infinitive): +tenho +tem +temos +tém +tinha +tÃnhamos +tinham +tive +teve +tivemos +tiveram +tivera +tivéramos +tenha +tenhamos +tenham +tivesse +tivéssemos +tivessem +tiver +tivermos +tiverem +terei +terá +teremos +terão +teria +terÃamos +teriam Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_pt.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ro.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ro.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ro.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ro.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,233 @@ +# This file was created by Jacques Savoy and is distributed under the BSD license. +# See http://members.unine.ch/jacques.savoy/clef/index.html. +# Also see http://www.opensource.org/licenses/bsd-license.html +acea +aceasta +aceastÄ +aceea +acei +aceia +acel +acela +acele +acelea +acest +acesta +aceste +acestea +aceÅti +aceÅtia +acolo +acum +ai +aia +aibÄ +aici +al +Äla +ale +alea +Älea +altceva +altcineva +am +ar +are +aÅ +aÅadar +asemenea +asta +Ästa +astÄzi +astea +Ästea +ÄÅtia +asupra +aÅ£i +au +avea +avem +aveÅ£i +azi +bine +bucur +bunÄ +ca +cÄ +cÄci +când +care +cÄrei +cÄror +cÄrui +cât +câte +câţi +cÄtre +câtva +ce +cel +ceva +chiar +cînd +cine +cineva +cît +cîte +cîţi +cîtva +contra +cu +cum +cumva +curând +curînd +da +dÄ +dacÄ +dar +datoritÄ +de +deci +deja +deoarece +departe +deÅi +din +dinaintea +dintr +dintre +drept +dupÄ +ea +ei +el +ele +eram +este +eÅti +eu +face +fÄrÄ +fi +fie +fiecare +fii +fim +fiÅ£i +iar +ieri +îi +îl +îmi +împotriva +în +înainte +înaintea +încât +încît +încotro +între +întrucât +întrucît +îţi +la +lângÄ +le +li +lîngÄ +lor +lui +mÄ +mâine +mea +mei +mele +mereu +meu +mi +mine +mult +multÄ +mulÅ£i +ne +nicÄieri +nici +nimeni +niÅte +noastrÄ +noastre +noi +noÅtri +nostru +nu +ori +oricând +oricare +oricât +orice +oricînd +oricine +oricît +oricum +oriunde +pânÄ +pe +pentru +peste +pînÄ +poate +pot +prea +prima +primul +prin +printr +sa +sÄ +sÄi +sale +sau +sÄu +se +Åi +sînt +sîntem +sînteÅ£i +spre +sub +sunt +suntem +sunteÅ£i +ta +tÄi +tale +tÄu +te +Å£i +Å£ie +tine +toatÄ +toate +tot +toÅ£i +totuÅi +tu +un +una +unde +undeva +unei +unele +uneori +unor +vÄ +vi +voastrÄ +voastre +voi +voÅtri +vostru +vouÄ +vreo +vreun Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ro.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ru.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ru.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ru.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ru.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,243 @@ + | From svn.tartarus.org/snowball/trunk/website/algorithms/russian/stop.txt + | This file is distributed under the BSD License. + | See http://snowball.tartarus.org/license.php + | Also see http://www.opensource.org/licenses/bsd-license.html + | - Encoding was converted to UTF-8. + | - This notice was added. + | + | NOTE: To use this file with StopFilterFactory, you must specify format="snowball" + + | a russian stop word list. comments begin with vertical bar. each stop + | word is at the start of a line. + + | this is a ranked list (commonest to rarest) of stopwords derived from + | a large text sample. + + | letter `Ñ' is translated to `е'. + +и | and +в | in/into +во | alternative form +не | not +ÑÑо | what/that +он | he +на | on/onto +Ñ | i +Ñ | from +Ñо | alternative form +как | how +а | milder form of `no' (but) +Ñо | conjunction and form of `that' +вÑе | all +она | she +Ñак | so, thus +его | him +но | but +да | yes/and +ÑÑ | thou +к | towards, by +Ñ | around, chez +же | intensifier particle +Ð²Ñ | you +за | beyond, behind +Ð±Ñ | conditional/subj. particle +по | up to, along +ÑолÑко | only +ее | her +мне | to me +бÑло | it was +Ð²Ð¾Ñ | here is/are, particle +Ð¾Ñ | away from +Ð¼ÐµÐ½Ñ | me +еÑе | still, yet, more +Ð½ÐµÑ | no, there isnt/arent +о | about +из | out of +ÐµÐ¼Ñ | to him +ÑепеÑÑ | now +когда | when +даже | even +Ð½Ñ | so, well +вдÑÑг | suddenly +ли | interrogative particle +еÑли | if +Ñже | already, but homonym of `narrower' +или | or +ни | neither +бÑÑÑ | to be +бÑл | he was +него | prepositional form of его +до | up to +Ð²Ð°Ñ | you accusative +нибÑÐ´Ñ | indef. suffix preceded by hyphen +опÑÑÑ | again +Ñж | already, but homonym of `adder' +вам | to you +Ñказал | he said +Ð²ÐµÐ´Ñ | particle `after all' +Ñам | there +поÑом | then +ÑÐµÐ±Ñ | oneself +ниÑего | nothing +ей | to her +Ð¼Ð¾Ð¶ÐµÑ | usually with `бÑÑÑ' as `maybe' +они | they +ÑÑÑ | here +где | where +еÑÑÑ | there is/are +надо | got to, must +ней | prepositional form of ей +Ð´Ð»Ñ | for +Ð¼Ñ | we +ÑÐµÐ±Ñ | thee +Ð¸Ñ | them, their +Ñем | than +бÑла | she was +Ñам | self +ÑÑоб | in order to +без | without +бÑдÑо | as if +Ñеловек | man, person, one +Ñего | genitive form of `what' +Ñаз | once +Ñоже | also +Ñебе | to oneself +под | beneath +Ð¶Ð¸Ð·Ð½Ñ | life +бÑÐ´ÐµÑ | will be +ж | short form of intensifer particle `же' +Ñогда | then +кÑо | who +ÑÑÐ¾Ñ | this +говоÑил | was saying +Ñого | genitive form of `that' +поÑÐ¾Ð¼Ñ | for that reason +ÑÑого | genitive form of `this' +какой | which +ÑовÑем | altogether +ним | prepositional form of `его', `они' +здеÑÑ | here +ÑÑом | prepositional form of `ÑÑоÑ' +один | one +поÑÑи | almost +мой | my +Ñем | instrumental/dative plural of `ÑоÑ', `Ñо' +ÑÑÐ¾Ð±Ñ | full form of `in order that' +нее | her (acc.) +кажеÑÑÑ | it seems +ÑейÑÐ°Ñ | now +бÑли | they were +кÑда | where to +заÑем | why +ÑказаÑÑ | to say +вÑÐµÑ | all (acc., gen. preposn. plural) +никогда | never +ÑÐµÐ³Ð¾Ð´Ð½Ñ | today +можно | possible, one can +пÑи | by +Ð½Ð°ÐºÐ¾Ð½ÐµÑ | finally +два | two +об | alternative form of `о', about +дÑÑгой | another +Ñ Ð¾ÑÑ | even +поÑле | after +над | above +болÑÑе | more +ÑÐ¾Ñ | that one (masc.) +ÑеÑез | across, in +ÑÑи | these +Ð½Ð°Ñ | us +пÑо | about +вÑего | in all, only, of all +Ð½Ð¸Ñ | prepositional form of `они' (they) +ÐºÐ°ÐºÐ°Ñ | which, feminine +много | lots +Ñазве | interrogative particle +Ñказала | she said +ÑÑи | three +ÑÑÑ | this, acc. fem. sing. +Ð¼Ð¾Ñ | my, feminine +впÑоÑем | moreover, besides +Ñ Ð¾ÑоÑо | good +ÑÐ²Ð¾Ñ | ones own, acc. fem. sing. +ÑÑой | oblique form of `ÑÑа', fem. `this' +пеÑед | in front of +иногда | sometimes +лÑÑÑе | better +ÑÑÑÑ | a little +Ñом | preposn. form of `that one' +нелÑÐ·Ñ | one must not +Ñакой | such a one +им | to them +более | more +вÑегда | always +конеÑно | of course +вÑÑ | acc. fem. sing of `all' +Ð¼ÐµÐ¶Ð´Ñ | between + + + | b: some paradigms + | + | personal pronouns + | + | Ñ Ð¼ÐµÐ½Ñ Ð¼Ð½Ðµ мной [мноÑ] + | ÑÑ ÑÐµÐ±Ñ Ñебе Ñобой [ÑобоÑ] + | он его ÐµÐ¼Ñ Ð¸Ð¼ [него, немÑ, ним] + | она ее Ñи ÐµÑ [нее, нÑи, неÑ] + | оно его ÐµÐ¼Ñ Ð¸Ð¼ [него, немÑ, ним] + | + | Ð¼Ñ Ð½Ð°Ñ Ð½Ð°Ð¼ нами + | Ð²Ñ Ð²Ð°Ñ Ð²Ð°Ð¼ вами + | они Ð¸Ñ Ð¸Ð¼ ими [Ð½Ð¸Ñ , ним, ними] + | + | ÑÐµÐ±Ñ Ñебе Ñобой [ÑобоÑ] + | + | demonstrative pronouns: ÑÑÐ¾Ñ (this), ÑÐ¾Ñ (that) + | + | ÑÑÐ¾Ñ ÑÑа ÑÑо ÑÑи + | ÑÑого ÑÑÑ ÑÑо ÑÑи + | ÑÑого ÑÑой ÑÑого ÑÑÐ¸Ñ + | ÑÑÐ¾Ð¼Ñ ÑÑой ÑÑÐ¾Ð¼Ñ ÑÑим + | ÑÑим ÑÑой ÑÑим [ÑÑоÑ] ÑÑими + | ÑÑом ÑÑой ÑÑом ÑÑÐ¸Ñ + | + | ÑÐ¾Ñ Ñа Ñо Ñе + | Ñого ÑÑ Ñо Ñе + | Ñого Ñой Ñого ÑÐµÑ + | ÑÐ¾Ð¼Ñ Ñой ÑÐ¾Ð¼Ñ Ñем + | Ñем Ñой Ñем [ÑоÑ] Ñеми + | Ñом Ñой Ñом ÑÐµÑ + | + | determinative pronouns + | + | (a) веÑÑ (all) + | + | веÑÑ Ð²ÑÑ Ð²Ñе вÑе + | вÑего вÑÑ Ð²Ñе вÑе + | вÑего вÑей вÑего вÑÐµÑ + | вÑÐµÐ¼Ñ Ð²Ñей вÑÐµÐ¼Ñ Ð²Ñем + | вÑем вÑей вÑем [вÑеÑ] вÑеми + | вÑем вÑей вÑем вÑÐµÑ + | + | (b) Ñам (himself etc) + | + | Ñам Ñама Ñамо Ñами + | Ñамого ÑÐ°Ð¼Ñ Ñамо ÑÐ°Ð¼Ð¸Ñ + | Ñамого Ñамой Ñамого ÑÐ°Ð¼Ð¸Ñ + | ÑÐ°Ð¼Ð¾Ð¼Ñ Ñамой ÑÐ°Ð¼Ð¾Ð¼Ñ Ñамим + | Ñамим Ñамой Ñамим [ÑамоÑ] Ñамими + | Ñамом Ñамой Ñамом ÑÐ°Ð¼Ð¸Ñ + | + | stems of verbs `to be', `to have', `to do' and modal + | + | бÑÑÑ Ð±Ñ Ð±Ñд бÑв еÑÑÑ ÑÑÑÑ + | име + | дел + | мог мож моÑÑ + | Ñме + | Ñ Ð¾Ñ Ñ Ð¾Ñ + | долж + | можн + | нÑжн + | нелÑÐ·Ñ + Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_ru.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_sv.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_sv.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_sv.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_sv.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,133 @@ + | From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt + | This file is distributed under the BSD License. + | See http://snowball.tartarus.org/license.php + | Also see http://www.opensource.org/licenses/bsd-license.html + | - Encoding was converted to UTF-8. + | - This notice was added. + | + | NOTE: To use this file with StopFilterFactory, you must specify format="snowball" + + | A Swedish stop word list. Comments begin with vertical bar. Each stop + | word is at the start of a line. + + | This is a ranked list (commonest to rarest) of stopwords derived from + | a large text sample. + + | Swedish stop words occasionally exhibit homonym clashes. For example + | sÃ¥ = so, but also seed. These are indicated clearly below. + +och | and +det | it, this/that +att | to (with infinitive) +i | in, at +en | a +jag | I +hon | she +som | who, that +han | he +pÃ¥ | on +den | it, this/that +med | with +var | where, each +sig | him(self) etc +för | for +sÃ¥ | so (also: seed) +till | to +är | is +men | but +ett | a +om | if; around, about +hade | had +de | they, these/those +av | of +icke | not, no +mig | me +du | you +henne | her +dÃ¥ | then, when +sin | his +nu | now +har | have +inte | inte nÃ¥gon = no one +hans | his +honom | him +skulle | 'sake' +hennes | her +där | there +min | my +man | one (pronoun) +ej | nor +vid | at, by, on (also: vast) +kunde | could +nÃ¥got | some etc +frÃ¥n | from, off +ut | out +när | when +efter | after, behind +upp | up +vi | we +dem | them +vara | be +vad | what +över | over +än | than +dig | you +kan | can +sina | his +här | here +ha | have +mot | towards +alla | all +under | under (also: wonder) +nÃ¥gon | some etc +eller | or (else) +allt | all +mycket | much +sedan | since +ju | why +denna | this/that +själv | myself, yourself etc +detta | this/that +Ã¥t | to +utan | without +varit | was +hur | how +ingen | no +mitt | my +ni | you +bli | to be, become +blev | from bli +oss | us +din | thy +dessa | these/those +nÃ¥gra | some etc +deras | their +blir | from bli +mina | my +samma | (the) same +vilken | who, that +er | you, your +sÃ¥dan | such a +vÃ¥r | our +blivit | from bli +dess | its +inom | within +mellan | between +sÃ¥dant | such a +varför | why +varje | each +vilka | who, that +ditt | thy +vem | who +vilket | who, that +sitta | his +sÃ¥dana | such a +vart | each +dina | thy +vars | whose +vÃ¥rt | our +vÃ¥ra | our +ert | your +era | your +vilkas | whose + Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_sv.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_th.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_th.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_th.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_th.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,119 @@ +# Thai stopwords from: +# "Opinion Detection in Thai Political News Columns +# Based on Subjectivity Analysis" +# Khampol Sukhum, Supot Nitsuwat, and Choochart Haruechaiyasak +à¹à¸§à¹ +à¹à¸¡à¹ +à¹à¸ +à¹à¸à¹ +à¹à¸«à¹ +à¹à¸ +à¹à¸à¸¢ +à¹à¸«à¹à¸ +à¹à¸¥à¹à¸§ +à¹à¸¥à¸° +à¹à¸£à¸ +à¹à¸à¸ +à¹à¸à¹ +à¹à¸à¸ +à¹à¸«à¹à¸ +à¹à¸¥à¸¢ +à¹à¸£à¸´à¹à¸¡ +à¹à¸£à¸² +à¹à¸¡à¸·à¹à¸ +à¹à¸à¸·à¹à¸ +à¹à¸à¸£à¸²à¸° +à¹à¸à¹à¸à¸à¸²à¸£ +à¹à¸à¹à¸ +à¹à¸à¸´à¸à¹à¸à¸¢ +à¹à¸à¸´à¸ +à¹à¸à¸·à¹à¸à¸à¸à¸²à¸ +à¹à¸à¸µà¸¢à¸§à¸à¸±à¸ +à¹à¸à¸µà¸¢à¸§ +à¹à¸à¹à¸ +à¹à¸à¸à¸²à¸° +à¹à¸à¸¢ +à¹à¸à¹à¸² +à¹à¸à¸² +à¸à¸µà¸ +à¸à¸²à¸ +à¸à¸°à¹à¸£ +à¸à¸à¸ +à¸à¸¢à¹à¸²à¸ +à¸à¸¢à¸¹à¹ +à¸à¸¢à¸²à¸ +หาภ+หลาย +หลัà¸à¸à¸²à¸ +หลัภ+หรืภ+หà¸à¸¶à¹à¸ +สà¹à¸§à¸ +สà¹à¸ +สุภ+สà¹à¸²à¸«à¸£à¸±à¸ +วà¹à¸² +วัภ+ลภ+รà¹à¸§à¸¡ +ราย +รัภ+ระหวà¹à¸²à¸ +รวม +ยัภ+มี +มาภ+มา +à¸à¸£à¹à¸à¸¡ +à¸à¸ +à¸à¹à¸²à¸ +à¸à¸¥ +à¸à¸²à¸ +à¸à¹à¸² +à¸à¸µà¹ +à¸à¹à¸² +à¸à¸±à¹à¸ +à¸à¸±à¸ +à¸à¸à¸à¸à¸²à¸ +à¸à¸¸à¸ +à¸à¸µà¹à¸ªà¸¸à¸ +à¸à¸µà¹ +à¸à¹à¸²à¹à¸«à¹ +à¸à¹à¸² +à¸à¸²à¸ +à¸à¸±à¹à¸à¸à¸µà¹ +à¸à¸±à¹à¸ +à¸à¹à¸² +à¸à¸¹à¸ +à¸à¸¶à¸ +à¸à¹à¸à¸ +à¸à¹à¸²à¸à¹ +à¸à¹à¸²à¸ +à¸à¹à¸ +à¸à¸²à¸¡ +à¸à¸±à¹à¸à¹à¸à¹ +à¸à¸±à¹à¸ +à¸à¹à¸²à¸ +à¸à¹à¸§à¸¢ +à¸à¸±à¸ +à¸à¸¶à¹à¸ +à¸à¹à¸§à¸ +à¸à¸¶à¸ +à¸à¸²à¸ +à¸à¸±à¸ +à¸à¸° +à¸à¸·à¸ +à¸à¸§à¸²à¸¡ +à¸à¸£à¸±à¹à¸ +à¸à¸ +à¸à¸¶à¹à¸ +à¸à¸à¸ +à¸à¸ +à¸à¸à¸° +à¸à¹à¸à¸ +à¸à¹ +à¸à¸²à¸£ +à¸à¸±à¸ +à¸à¸±à¸ +à¸à¸§à¹à¸² +à¸à¸¥à¹à¸²à¸§ Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_th.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_tr.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_tr.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_tr.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_tr.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,212 @@ +# Turkish stopwords from LUCENE-559 +# merged with the list from "Information Retrieval on Turkish Texts" +# (http://www.users.muohio.edu/canf/papers/JASIST2008offPrint.pdf) +acaba +altmıŠ+altı +ama +ancak +arada +aslında +ayrıca +bana +bazı +belki +ben +benden +beni +benim +beri +beÅ +bile +bin +bir +birçok +biri +birkaç +birkez +birÅey +birÅeyi +biz +bize +bizden +bizi +bizim +böyle +böylece +bu +buna +bunda +bundan +bunlar +bunları +bunların +bunu +bunun +burada +çok +çünkü +da +daha +dahi +de +defa +deÄil +diÄer +diye +doksan +dokuz +dolayı +dolayısıyla +dört +edecek +eden +ederek +edilecek +ediliyor +edilmesi +ediyor +eÄer +elli +en +etmesi +etti +ettiÄi +ettiÄini +gibi +göre +halen +hangi +hatta +hem +henüz +hep +hepsi +her +herhangi +herkesin +hiç +hiçbir +için +iki +ile +ilgili +ise +iÅte +itibaren +itibariyle +kadar +karÅın +katrilyon +kendi +kendilerine +kendini +kendisi +kendisine +kendisini +kez +ki +kim +kimden +kime +kimi +kimse +kırk +milyar +milyon +mu +mü +mı +nasıl +ne +neden +nedenle +nerde +nerede +nereye +niye +niçin +o +olan +olarak +oldu +olduÄu +olduÄunu +olduklarını +olmadı +olmadıÄı +olmak +olması +olmayan +olmaz +olsa +olsun +olup +olur +olursa +oluyor +on +ona +ondan +onlar +onlardan +onları +onların +onu +onun +otuz +oysa +öyle +pek +raÄmen +sadece +sanki +sekiz +seksen +sen +senden +seni +senin +siz +sizden +sizi +sizin +Åey +Åeyden +Åeyi +Åeyler +Åöyle +Åu +Åuna +Åunda +Åundan +Åunları +Åunu +tarafından +trilyon +tüm +üç +üzere +var +vardı +ve +veya +ya +yani +yapacak +yapılan +yapılması +yapıyor +yapmak +yaptı +yaptıÄı +yaptıÄını +yaptıkları +yedi +yerine +yetmiÅ +yine +yirmi +yoksa +yüz +zaten Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/stopwords_tr.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain Added: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/userdict_ja.txt URL: http://svn.apache.org/viewvc/ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/userdict_ja.txt?rev=1707042&view=auto ============================================================================== --- ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/userdict_ja.txt (added) +++ ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/userdict_ja.txt Tue Oct 6 12:48:53 2015 @@ -0,0 +1,29 @@ +# +# This is a sample user dictionary for Kuromoji (JapaneseTokenizer) +# +# Add entries to this file in order to override the statistical model in terms +# of segmentation, readings and part-of-speech tags. Notice that entries do +# not have weights since they are always used when found. This is by-design +# in order to maximize ease-of-use. +# +# Entries are defined using the following CSV format: +# <text>,<token 1> ... <token n>,<reading 1> ... <reading n>,<part-of-speech tag> +# +# Notice that a single half-width space separates tokens and readings, and +# that the number tokens and readings must match exactly. +# +# Also notice that multiple entries with the same <text> is undefined. +# +# Whitespace only lines are ignored. Comments are not allowed on entry lines. +# + +# Custom segmentation for kanji compounds +æ¥æ¬çµæ¸æ°è,æ¥æ¬ çµæ¸ æ°è,ããã³ ã±ã¤ã¶ã¤ ã·ã³ãã³,ã«ã¹ã¿ã åè© +é¢è¥¿å½é空港,é¢è¥¿ å½é 空港,ã«ã³ãµã¤ ã³ã¯ãµã¤ ã¯ã¦ã³ã¦,ã«ã¹ã¿ã åè© + +# Custom segmentation for compound katakana +ãã¼ãããã°,ãã¼ã ããã°,ãã¼ã ããã°,ããã«ãåè© +ã·ã§ã«ãã¼ããã°,ã·ã§ã«ãã¼ ããã°,ã·ã§ã«ãã¼ ããã°,ããã«ãåè© + +# Custom reading for former sumo wrestler +æéé¾,æéé¾,ã¢ãµã·ã§ã¦ãªã¥ã¦,ã«ã¹ã¿ã 人å Propchange: ofbiz/trunk/specialpurpose/solr/home/solrdefault/conf/lang/userdict_ja.txt ------------------------------------------------------------------------------ svn:mime-type = text/plain |
Free forum by Nabble | Edit this page |