Papers
Publication pipeline and artifact releases.
Preprints and artifact bundles will be posted here as they become citable.
Research papers
P1: Finland replay compiler in preparation
Working title: Reconstructing Point-in-Time Statutory State from Amendment Streams: A Replay Compiler for Finnish Law
Target venue: JURIX 2026 / ICAIL 2027
LawVM replays Finnish amendment streams into point-in-time legal state, compares the result with official consolidation surfaces, and classifies residual mismatches into operationally meaningful types rather than collapsing them into one error bucket. Evaluated on 690 Finnish statutes (0.65% mean text distance from Finlex), the system identifies 77+ cases where the official consolidation is demonstrably incorrect. The remaining frontier is structured rather than random.
Status: Corpus evidence collected. Golden dataset growing (77+ verified entries). Evaluation framework defined. Drafting in progress.
P2: Typed residuals as evidence planned
Working title: From Diff Noise to Typed Evidence: Evaluating Statutory Replay under Heterogeneous Truth Surfaces
Raw similarity between replayed text and official consolidation routinely misleads. Replay defects, source pathologies, editorial artifacts, and non-commensurable surfaces need distinct treatment. A single similarity score collapses operationally different failure modes and hides cases where the replay system is more correct than the oracle.
Status: May split from P1 or publish standalone.
P3: Corrigenda as higher-order operations planned
Working title: Corrigenda as Operations over Operations in Statutory Replay
Corrigenda are better modeled as higher-order operations (operations over prior amendment operations) than as flat editorial notes. Finnish corrigendum corpus: 471 entries, 180 statutes. In 44 statutes, corrigendum application makes LawVM more legally accurate than Finlex.
Status: Corpus exists. Formalization needed.
P4: Target resolution future
Working title: Proof-Bearing Target Resolution for Statutory Amendment Replay
Status: Implementation exists. Paper formalization pending.
Software publication
JOSS paper planned
A Journal of Open Source Software paper for LawVM. Software credit and citation stability. Pending install/test workflow stability.
Resource publication
Finland benchmark + verification ledger planned
The frozen corpus snapshot, golden dataset, and benchmark methodology as a citable dataset publication. Target venue: Scientific Data or Data in Brief. Pending golden dataset scale (200+ entries) and distribution rights.
Artifact releases
DOI-backed Zenodo releases are planned for:
- Frozen corpus snapshot
- Software release archive
- Golden dataset (verified divergence entries)
- Publication database (SQLite)
Status: Pending corpus freeze before first paper submission.
Citation
Until a paper is published, cite the software:
Kunnas, E. (2026). LawVM: An open-source replay compiler
for amendment-driven law. https://github.com/eliask/lawvm
A CITATION.cff file is included in the repository.