Finland

The empirical proving ground.

Why Finland?

Finland combines difficult replay dynamics with unusually open source material and a legally non-authoritative consolidation surface. Finlex consolidated texts are informational. The operative source set is the original statute plus all amendment acts published in Säädöskokoelma.

This makes replay-vs-oracle comparison scientifically meaningful: when LawVM and Finlex disagree, the disagreement can be independently adjudicated against primary sources. A divergence might be a LawVM bug, a source gap, an editorial convention, or a candidate issue in the official consolidation surface.

Corpus

The current Finnish alpha corpus contains hundreds of fully replayable amended statutes, spanning from the 1920s to the 2020s. They are replayed from raw amendment acts in Säädöskokoelma XML and curated from a larger set of amended statutes where source availability varies.

The table below describes the frozen 2026-04-16 benchmark snapshot, not a permanent corpus definition.

DecadeStatutesAvg amendmentsMax
1920s–1960s101567
1970s2013.761
1980s2514.464
1990s10720.6238
2000s17313.3166
2010s2375.352
2020s1182.518

v0.1 alpha evidence snapshot. Benchmark figures are provisional and depend on the frozen source/oracle archive.

Benchmark

Text distance

0.65%
Provisional mean Levenshtein distance from the archived Finlex consolidation comparison surface. Most of this distance reflects editorial conventions (punctuation, synonyms, notes), not substantive differences.

Structural error

4.25%
Mean section-level structural divergence. 367 perfect structural match. 490 at ≥95%.

<90% match

104
Provisional investigation frontier in the alpha corpus.

Benchmark snapshot: 2026-04-16, mode: finlex_oracle. Two metrics: Levenshtein measures character-level text distance; structural error measures section-level divergence. See Artifacts for corpus definition and reproducibility.

How to read this: The benchmark measures structural agreement with Finlex. Legal correctness requires source review. Some divergences become high-confidence candidate findings when primary sources support LawVM over the Finlex consolidation. The verification loop is manual residual review against primary sources.

The golden dataset

We systematically investigate divergences between LawVM and Finlex. So far, 22 high-confidence meaningful candidate findings have been reported to Finlex for external review. Until Finlex or another competent authority confirms them, they should be described as candidate findings rather than verified official errors.

Each finding is documented in a structured internal entry with Finnish prose, root cause classification, affected sections, and source evidence. The public page below includes a compact bilingual digest; the full Finnish evidence viewer is not linked as part of the public website surface yet.

Root cause taxonomy

Root causeCountMeaning
finlex_missing_content10Finlex omits content published in Säädöskokoelma
source_pathology9Source XML itself is defective
finlex_missing_otsikko≥9Finlex omits section headings
oracle_stale_cutoff≥6Finlex has not applied recent amendments
finlex_editorial_addition4Finlex added content without amendment basis
high_uncovered_body3Structural amendment coverage gap
finlex_dual_element_residual3Finlex has dual current/pending versions
corrigendum_misapplied1Published correction not applied
+ 7 more categories

22 high-confidence meaningful candidate findings reported to Finlex as of 2026-04-25. Target: full classification of all divergences.

Reported candidate findings

The following 22 high-confidence meaningful candidate findings were sent to Finlex support under ticket RITM0644009 on 2026-04-23. They were reported as the author's best interpretation after checking Finlex XML, Finlex HTML, source acts, and relevant source artifacts. They remain candidate findings until confirmed by Finlex or another competent authority.

English statute titles are unofficial translations for orientation. The Finnish toggle preserves the original Finnish titles and wording.

  1. Tonnage Tax Act (Tonnistoverolaki, 476/2002). Amendment 1249/2009 appears not to be reflected in the current Finlex text. Candidate differences include 12 § 1 momentti changing from fully non-deductible acquisition cost to partial deductibility for income-taxable use, 18 § changing the threshold from less than half of equity to less than 1/9, and 22 § plus 24 § remaining visible even though the amendment repealed them.
  2. Fuel Fee Act (Polttoainemaksulaki, 1280/2003). Amendment 135/2019 replaced 1 § with three subsections. Finlex appears to show only two; the third subsection concerning T1-class light vehicles is missing.
  3. Decree on Mire Conservation Areas (Asetus soidensuojelualueista, 852/1988). The current Finlex text appears to contain five consolidation/text defects: a full sentence missing from 2 § 2 momentti, "metsästys" rendered as "metsäsätys", "vähentäminen" as "vähentämien", a missing comma in "toimenpiteet, jotka", and "noudatettaviksi" as "neudatettaviksi". No amendment was found changing these passages.
  4. Decree on Development Cooperation Assignments (Asetus kehitysyhteistyötehtävistä, 383/1989). In 4 § 1 momentti, "Kehitysyhteistyötehtävää" appears as "Kehitysyhteistyötehtäväää" in the Finlex consolidation.
  5. Act on the Autonomy of Åland (Ahvenanmaan itsehallintolaki, 1144/1991). Candidate text defects include missing "vähintään" in 69 § 2 momentti, missing "sellaisten ylioikeuksien ja muiden" in 38 § 1 momentti, and "kauppa-aluksilla" split as "kauppa aluksilla" in 18 § 3 kohta.
  6. Forest Improvement Decree (Metsänparannusasetus, 437/1987). Amendment 1563/1992 changed 2 § items 1 and 2. The original section had four items, but Finlex appears to show only items 1 and 2; unchanged items 3 and 4 are missing.
  7. Mental Health Decree (Mielenterveysasetus, 1247/1990). Amendment 1446/1992 changed the reference in 2 § to "mielenterveyslain 8 §:n 3 momentissa". Finlex appears to retain the old "8 §:n 2 momentissa" reference.
  8. Act on the Coordination of Social Security Systems (Laki sosiaaliturvajärjestelmien yhteensovittamisesta, 352/2010). In 11 § 2 momentti, Finlex appears to show only items 1-3; items 4-6 are missing.
  9. Act on Enforcement Fees (Laki ulosottomaksuista, 34/1995). In 5 § 3 momentti, the source text has three sentences. Finlex appears to show only the first sentence, omitting sentences concerning the bailiff's discretion and appeal bar.
  10. Act on Restrictions on the Use of Arava Rental Dwellings (Aravarajoituslaki, 1190/1993). Candidate typographic defects: "10 §: n" in 12 § 1 momentti, "vuokra asunnon" in 12 § 2 momentti, and "asunto osakeyhtiötaloksi" in the 13 § heading.
  11. Rules of Procedure of Parliament (Eduskunnan työjärjestys, 40/2000). Candidate spacing/word-fusion defects: "keskustelua,jollei" in 16 § 1 momentti, "lisätalousarvioesityksenja" in 20 § 2 momentti, and "asioidenkäsittelystä" in 69 § 1 momentti.
  12. Decree on Protection Areas (Asetus suoja-alueista, 1125/2000). In the 1 § table, "Kirkonmaan suoja-alue" appears as "Kirkkomaan suoja-alue". No amendment was found changing the place name.
  13. Act on Promoting the Use of Renewable Fuels in Transport (Laki uusiutuvien polttoaineiden käytön edistämisestä liikenteessä, 446/2007). In 1 § 2 momentti, "ReFuelEU Aviation ‑aloite" appears to be corrupted as "ReFuelEU Aviation âaloite".
  14. Antarctic Environmental Protection Act (Etelämantereen ympäristönsuojelulaki, 28/1998). Amendment 1020/2010 repealed 42 § as of 2011-01-01. Finlex appears to show both the repeal note and the old 42 § text as an active structure.
  15. Railway Tracks Act (Ratalaki, 110/2007). Amendment 741/2018 repealed 89 § 6 momentti from 2020-06-01. Finlex appears to continue showing the repealed subsection as active text, while its own change history records the repeal.
  16. Decree on the Right to Transfer State-Owned Real Property (Asetus oikeudesta luovuttaa valtion maaomaisuutta, 693/1978). In 3 §, Finlex appears to show both the old ministry references and the later environment-ministry wording in parallel, even though amendment 787/1988 replaced the section.
  17. Act on Farmers' Early Retirement Compensation (Laki maatalousyrittäjien luopumiskorvauksesta, 1330/1992). Amendment 42/1997 added 38 a § from 1997-09-01. The Finnish Finlex current text appears to skip from 38 § to 39 §, while the Swedish page shows 38 a §.
  18. Decree on Information to Be Given to Consumers on Credit Agreements (Asetus luottosopimuksesta kuluttajalle annettavista tiedoista, 789/2010). In 7 § 4 momentti 4 kohta, the reference "kuluttajansuojalain 7 lukua" appears on the Finlex page as "kuluttajansuojalain 7 luku a.", apparently due to hyperlink text splitting.
  19. Ministry of Education and Culture Decree on the Boards of Certain State Educational Institutions (OKM asetus eräiden valtion oppilaitosten johtokunnista, 410/2011). Temporary amendment 434/2011 added one sentence to 2 § 2 momentti and one sentence to 3 § 1 momentti. Both appear to be missing from the Finlex current page for the temporary validity interval 2011-05-13 to 2011-10-01.
  20. Government Decree on Grants to Non-Profit Communities and Foundations (Valtioneuvoston asetus valtionavustuksista yleishyödyllisille yhteisöille ja säätiöille, 716/2014). Amendment 697/2020 added temporary 8 b § on aid for small enterprises in difficulty. Finlex appears to skip from 8 a § to 9 §, although Finlex's own validity note mentions 8 b §.
  21. Decree on the Structures of Degrees under the Universities Act (Asetus yliopistolain mukaisten tutkintojen rakenteista, 869/2017). Amendment 613/2020 added temporary 3 a § for 2020-09-01 to 2021-07-31. Finlex appears to skip from 3 § to 4 §.
  22. Decree on Chargeable Services of the Finnish Transport Infrastructure Agency (Asetus Väyläviraston maksullisista suoritteista, 1465/2019). Amendment 382/2020 added temporary 3 a § for 2020-06-01 to 2020-08-31. Finlex appears to skip from 3 § to 4 §.

The forward goal

Current evidence shows repeated classes of cases where replay outperforms the official consolidation. Of the remaining divergence, a substantial share is attributable to the editorial surface rather than the compiler — with deterministic compilation as the native pipeline, the theoretical ceiling approaches 100% for well-drafted legislation with clean source data. The v0.1 goal is the zero-to-one construction: compile ordinary amendment law at high coverage against a real national corpus, bring Finland close to complete text-state replay, and leave enough architecture, evidence, and tooling for others to carry the remaining work forward.

The next Finland work is concrete: freeze a citable benchmark, reproduce the source archive, classify the residual frontier, verify or reject more candidate findings, and turn remaining LawVM defects into named interaction families with tests. If your institution wants to maintain part of that work, see the handoff map.