Papers
Publication pipeline and artifact releases.
Preprints and artifact bundles will be posted here as they become citable.
Research papers
P1: Finland replay compiler in preparation
Working title: Reconstructing Point-in-Time Statutory State from Amendment Streams: A Replay Compiler for Finnish Law
Target venue: JURIX 2026 / ICAIL 2027
LawVM replays Finnish amendment streams into point-in-time legal state, compares the result with official consolidation surfaces, and classifies residual mismatches into operationally meaningful types rather than collapsing them into one error bucket. The v0.1 alpha evaluation is framed as a zero-to-one construction proof: ordinary human-written amendment law can be compiled to high coverage against a real national corpus. It identifies hundreds of replay-vs-Finlex divergences and a smaller set of 22 high-confidence meaningful candidate findings reported to Finlex. These remain candidate findings pending authority confirmation.
Status: Corpus evidence collected. Candidate-finding packet reported to Finlex. Evaluation framework defined. Drafting in progress. The intended contribution is to make the remaining work concrete enough for others to extend, verify, and institutionalize.
P2: Typed residuals as evidence planned
Working title: From Diff Noise to Typed Evidence: Evaluating Statutory Replay under Heterogeneous Truth Surfaces
Raw similarity between replayed text and official consolidation routinely misleads. Replay defects, source pathologies, editorial artifacts, and non-commensurable surfaces need distinct treatment. A single similarity score collapses operationally different failure modes and hides cases where the replay system is more correct than the oracle.
Status: May split from P1 or publish standalone.
P3: Corrigenda as higher-order operations planned
Working title: Corrigenda as Operations over Operations in Statutory Replay
Corrigenda are better modeled as higher-order operations (operations over prior amendment operations) than as flat editorial notes. Finnish corrigendum corpus: 471 entries, 180 statutes. In 44 statutes, corrigendum application makes LawVM more legally accurate than Finlex.
Status: Corpus exists. Formalization needed.
P4: Target resolution future
Working title: Proof-Bearing Target Resolution for Statutory Amendment Replay
Status: Implementation exists. Paper formalization pending.
Software publication
JOSS paper planned
A Journal of Open Source Software paper for LawVM. Software credit and citation stability. Pending install/test workflow stability.
Resource publication
Finland benchmark + verification ledger planned
The frozen corpus snapshot, golden dataset, and benchmark methodology as a citable dataset publication. Target venue: Scientific Data or Data in Brief. Pending golden dataset scale (200+ entries) and distribution rights.
Artifact releases
DOI-backed Zenodo releases are planned for:
- Frozen corpus snapshot
- Software release archive
- Golden dataset (candidate divergence findings and adjudicated residuals)
- Publication database (SQLite)
Status: Pending corpus freeze before first paper submission.
Citation
Until a paper is published, cite the software:
Kunnas, E. (2026). LawVM: An open-source replay compiler
for amendment-driven law. https://github.com/eliask/lawvm
A CITATION.cff file is included in the repository.