Full-Text Index Stopwords

Stopwords are used to provide a list of commonly-used words that can be ignored for the purposes of Full-text-indexes.

Full-text indexes built in MyISAM and InnoDB have different stopword lists by default.

MyISAM Stopwords

For full-text indexes on MyISAM tables, by default, the list is built from the file storage/myisam/ft_static.c, and searched using the server's character set and collation. The ft_stopword_file system variable allows the default list to be overridden with words from another file, or for stopwords to be ignored altogether.

If the stopword list is changed, any existing full-text indexes need to be rebuilt

The following table shows the default list of stopwords, although you should always treat storage/myisam/ft_static.c as the definitive list. See the Fulltext Index Overview for more details, and Full-text-indexes for related articles.

a'sableaboutabove
accordingaccordinglyacrossactually
afterafterwardsagainagainst
ain'tallallowallows
almostalonealongalready
alsoalthoughalwaysam
amongamongstanand
anotheranyanybodyanyhow
anyoneanythinganywayanyways
anywhereapartappearappreciate
appropriatearearen'taround
asasideaskasking
associatedatavailableaway
awfullybebecamebecause
becomebecomesbecomingbeen
beforebeforehandbehindbeing
believebelowbesidebesides
bestbetterbetweenbeyond
bothbriefbutby
c'monc'scamecan
can'tcannotcantcause
causescertaincertainlychanges
clearlycocomcome
comesconcerningconsequentlyconsider
consideringcontaincontainingcontains
correspondingcouldcouldn'tcourse
currentlydefinitelydescribeddespite
diddidn'tdifferentdo
doesdoesn'tdoingdon't
donedowndownwardsduring
eacheduegeight
eitherelseelsewhereenough
entirelyespeciallyetetc
evenevereveryeverybody
everyoneeverythingeverywhereex
exactlyexampleexceptfar
fewfifthfirstfive
followedfollowingfollowsfor
formerformerlyforthfour
fromfurtherfurthermoreget
getsgettinggivengives
gogoesgoinggone
gotgottengreetingshad
hadn'thappenshardlyhas
hasn'thavehaven'thaving
hehe'shellohelp
henceherherehere's
hereafterherebyhereinhereupon
hersherselfhihim
himselfhishitherhopefully
howhowbeithoweveri'd
i'lli'mi'veie
ifignoredimmediatein
inasmuchincindeedindicate
indicatedindicatesinnerinsofar
insteadintoinwardis
isn'titit'dit'll
it'sitsitselfjust
keepkeepskeptknow
knowsknownlastlately
laterlatterlatterlyleast
lesslestletlet's
likelikedlikelylittle
looklookinglooksltd
mainlymanymaymaybe
memeanmeanwhilemerely
mightmoremoreovermost
mostlymuchmustmy
myselfnamenamelynd
nearnearlynecessaryneed
needsneithernevernevertheless
newnextnineno
nobodynonnonenoone
nornormallynotnothing
novelnownowhereobviously
ofoffoftenoh
okokayoldon
onceoneonesonly
ontoorotherothers
otherwiseoughtourours
ourselvesoutoutsideover
overallownparticularparticularly
perperhapsplacedplease
pluspossiblepresumablyprobably
providesquequiteqv
ratherrdrereally
reasonablyregardingregardlessregards
relativelyrespectivelyrightsaid
samesawsaysaying
sayssecondsecondlysee
seeingseemseemedseeming
seemsseenselfselves
sensiblesentseriousseriously
sevenseveralshallshe
shouldshouldn'tsincesix
sosomesomebodysomehow
someonesomethingsometimesometimes
somewhatsomewheresoonsorry
specifiedspecifyspecifyingstill
subsuchsupsure
t'staketakentell
tendsththanthank
thanksthanxthatthat's
thatsthetheirtheirs
themthemselvesthenthence
therethere'sthereafterthereby
thereforethereintheresthereupon
thesetheythey'dthey'll
they'rethey'vethinkthird
thisthoroughthoroughlythose
thoughthreethroughthroughout
thruthustotogether
tootooktowardtowards
triedtriestrulytry
tryingtwicetwoun
underunfortunatelyunlessunlikely
untiluntoupupon
ususeuseduseful
usesusingusuallyvalue
variousveryviaviz
vswantwantswas
wasn'twaywewe'd
we'llwe'rewe'vewelcome
wellwentwereweren't
whatwhat'swhateverwhen
whencewheneverwherewhere's
whereafterwhereaswherebywherein
whereuponwhereverwhetherwhich
whilewhitherwhowho's
whoeverwholewhomwhose
whywillwillingwish
withwithinwithoutwon't
wonderwouldwouldn'tyes
yetyouyou'dyou'll
you'reyou'veyouryours
yourselfyourselveszero

InnoDB Stopwords

Stopwords on full-text indexes are only enabled if the innodb_ft_enable_stopword system variable is set (by default it is) at the time the index was created.

The stopword list is determined as follows:

In the first two cases, the specified table must exist at the time the system variable is set and the full-text index created. It must be an InnoDB table with a single column, a VARCHAR named VALUE.

The default InnoDB stopword list differs from the default MyISAM list, being much shorter, and contains the following words:

aaboutanare
asatbeby
comdeenfor
fromhowiin
isitlaof
onorthatthe
thistowaswhat
whenwherewhowill
withundthewww

Comments

Comments loading...
Content reproduced on this site is the property of its respective owners, and this content is not reviewed in advance by MariaDB. The views, information and opinions expressed by this content do not necessarily represent those of MariaDB or any other party.