** one preferred
* obban - obbán

obban
obban	obban+Adv

obbán
obbán	obbát+V+IV+Actio+Nom
obbán	obbát+V+IV+Actio+Gen
obbán	obbát+V+IV+Actio+Acc
obbán	obbát+V+IV+PrfPrc
obbán	obbát+V+IV+Der/n+N+Sg+Nom
obbán	obbát+V+IV+Der/n+N+Sg+Gen
obbán	obbát+V+IV+Ind+Prs+Sg1

* dohko - dohkko

dohko
dohko	dohko+Adv

dohkko
dohkko	doahkkut+V+IV+Der/PassS+V+Ind+Prs+Sg3
dohkko	doahkkut+V+IV+Der/PassS+V+Ind+Prs+ConNeg
dohkko	doahkkut+V+IV+Ind+Prs+Du1
dohkko	doahkkut+V+IV+Ind+Prt+Pl3
dohkko	doahkkut+V+IV+Imprt+ConNegII


* báhcán - báhcan

báhcán
báhcán	báhcit+V+IV+PrfPrc

báhcan
báhcan	báhccat+V+IV+Ind+Prs+Sg1

* jearrán - jearran

jearrán
jearrán	jearrat+V+TV+Actor+N+Sg+Nom+PxSg1
jearrán	jearrat+V+TV+Actor+N+Sg+Gen+PxSg1
jearrán	jearrat+V+TV+Actor+N+Sg+Acc+PxSg1

jearran
jearran	jearrat+V+TV+Actio+Nom
jearran	jearrat+V+TV+Actio+Gen
jearran	jearrat+V+TV+Actio+Acc
jearran	jearrat+V+TV+PrfPrc
jearran	jearrat+V+TV+Der/n+N+Sg+Nom
jearran	jearrat+V+TV+Der/n+N+Sg+Gen

* ihttin - ihtin?

ihtin
ihtin	ihtit+V+IV+Actio+Nom
ihtin	ihtit+V+IV+Actio+Gen
ihtin	ihtit+V+IV+Actio+Acc
ihtin	ihtit+V+IV+Der/n+N+Sg+Nom
ihtin	ihtit+V+IV+Der/n+N+Sg+Gen
ihtin	ihtin+Adv

ihttin
ihttin	ihtit+V+IV+Actor+N+Ess
ihttin	ihttin+Adv

* sága - saga

sága
sága	sáhka+N+Sg+Gen
sága	sáhka+N+Sg+Acc
sága	sáhkat+V+TV+VGen
sága	sáhkat+V+TV+Imprt+Sg2
sága	sáhkat+V+TV+Imprt+ConNeg
sága	sáhkat+V+TV+Ind+Prs+ConNeg

saga
saga	sahkat+V+IV+VGen
saga	sahkat+V+IV+Imprt+Sg2
saga	sahkat+V+IV+Imprt+ConNeg
saga	sahkat+V+IV+Ind+Prs+ConNeg
saga	saga+N+Sg+Nom
saga	saga+N+Sg+Gen
saga	saga+N+Sg+Acc

* bargui - bárgui

bargui
bargui	bargu+N+Sg+Ill
bargui	bargat+V+TV+Der/PassS+V+Ind+Prt+Sg3

bárgui
bárgui	bárgut+V+IV+Actor+N+Sg+Ill

* mánnu - mannu #Duommá

mánnu
mánnu	mánnu+Sem/Hum+N+Sg+Nom

mannu
mannu	mannat+V+IV+Imprt+Du1
mannu	mannut+V+IV+Imprt+Du2
mannu	mannut+V+IV+Imprt+Du1
mannu	mannut+V+IV+Ind+Prs+Sg3
mannu	mannut+V+IV+Actor+N+Sg+Acc
mannu	mannut+V+IV+Actor+N+Sg+Gen
mannu	mannut+V+IV+Actor+N+Sg+Nom
mannu	mannu+N+Sg+Nom


** equal weight??

* manne - mánne

manne
manne	mannat+V+IV+Ind+Prs+Du1
manne	mannat+V+IV+Ind+Prt+Pl3
manne	manne+Adv
manne	mannet+V+IV+VGen
manne	mannet+V+IV+Imprt+Sg2
manne	mannet+V+IV+Imprt+ConNeg
manne	mannet+V+IV+Ind+Prs+Sg3
manne	mannet+V+IV+Ind+Prs+ConNeg

mánne
mánne	mánnat+V+TV+Ind+Prs+Du1
mánne	mánnat+V+TV+Ind+Prt+Pl3


How to find mistakes in a quicker way:
* typical orthographic mistakes which result in real world errors
* find - replace in the lexicon and analyze looking for homonyms
* can we seammaládje, norggabealde, ovdamearkkadihte?

3rd person verb/ PrfPrc
at, it
a -> á
á -> a
tt -> t leago diet usual?!
xy -> xyy   
xyy -> xy

PrfPrc

biilas vs biillas 
biila+N+Sg+Nom+PxSg3 vs. biillas	biila+N+Sg+Loc

leaska leaskka

gávdnat - gávdnát
gávdnat
gávdnat	gávdnat+V+TV+Inf
gávdnat	gávdnat+V+TV+Ind+Prs+Pl1

gávdnát
gávdnát	gávdni+N+Sg+Nom+PxSg2
gávdnát	gávdnat+V+TV+Actor+N+Sg+Nom+PxSg2
gávdnát	gávdnat+V+TV+Actor+N+Sg+Gen+PxSg2
gávdnát	gávdnat+V+TV+Actor+N+Sg+Acc+PxSg2

REAL word errors

cohort:

REMOVE @-FMAINV IF (0 Inf) 
REMOVE @-FMAINV IF (0 Inf LINK 0 Pl1) seammá cohort === regelen slår til
REMOVE @-FMAINV IF (0 Inf + Pl1) seammá linju ===== regelen slår ikkje til
"<barggastuvvat>"
        "barggastuvvat" V IV Inf @-FMAINV MAP:8813:r413:AllFmainv &*real-a #1->1 
;       "bargat" V* TV Der/stuvva V Ind Prs Pl1 REMOVE:2809:r548 
;       "bargat" V* TV Der/stuvva V Inf REMOVE:2809:r548 
;       "barggastuvvat" V IV Ind Prs Pl1 REMOVE:8122:r1821 



categories:

1. (very very unlikely) non-existent forms where we can match an error tag right away:
gallet+V+TV+VGen   galle   gálle gearddi
gálle	gállat+V+IV+Ind+Prs+Du1
gálle	gállat+V+IV+Ind+Prt+Pl3
gálle	gállit+V+IV+Ind+Prs+Du1
gálle	gállit+V+IV+Ind+Prt+Pl3
gálle	gállet+V+TV+VGen
gálle	gállet+V+TV+Imprt+Sg2
gálle	gállet+V+TV+Imprt+ConNeg
gálle	gállet+V+TV+Ind+Prs+Sg3
gálle	gállet+V+TV+Ind+Prs+ConNeg

2. pairs where we can clearly distinguish syntactically
e.g. noun vs. verb Imprt
or 1Sg vs ConNeg
by means of valency

2.1.
2.2.
2.3.
etc.

3. pairs which we can clearly distinguish semantically

e.g. appear only in a particular semantic context


LIST of words for which we need specific real word rules:
čalmmástuvve                    #MID
dáhppohalle                     #MID
gáskkahalle                     #MID
lássehalle                      #MID
bággehalle                      #DAB

These are words with two á vs two a:
mánástadde                      #MID - mánástádde manastadde
mánástalle                      #MID
mánástahtte                     #MID
njálggástadde                   #MID
njálggástalle                   #MID
gáskkáhalle                     #MID


LIST VARIANTS = "guođáhallat" "guođahallat" "oainnahallat" "oainnáhallat" "vuoittáhallat" "vuoittahallat" "govssahallat" "govssáhallat" "gáskkahallat" "gáskkáhallat" "časkkahallat" "časkkáhallat" "njuorddahallat" "njuorddáhallat" "cakkastallat" "cakkástallat" "doamahallat" "doamáhallat"   "boalbalastit" "boalbálastit" "gierastallat" "gierástallat" "gierastaddat" "gierástaddat"  ;

LIST ALMOST-VARIANTS =
"buorranaddat" "buorránaddat" "buorranastit" "buorránastit" "buorraneastit" "buorráneastit" "jierpmastallat" "jierpmastaddat" "jierpmástallat" "jierpmástaddat" ;

