SMT-COMP

The International Satisfiability Modulo Theories (SMT) Competition.

GitHub

Home
Introduction
Benchmark Submission
Publications
SMT-LIB
Previous Editions

SMT-COMP 2012

Rules
Benchmarks
Tools
Specs
Participants
Results
Report

Benchmarks

Note: The benchmark archives that were included on this page are lost and not available anymore. The following information is kept mainly for archival purposes.

These benchmarks and tools are subject to change until their respective freeze dates, as documented in the SMT-COMP rules.

Application Track Benchmarks

2012 Additions

Benchmarks from 2011

Unsat Core Track Benchmarks

The unsat core benchmarks are the subset of the benchmarks from the indicated logics that are unsatisfiable; the benchmarks themselves have been modified to include names for the assertions. Per the SMTLIB standard, the benchmarks are allowed to contain a mix of named and unmaned formulae, though ordinarily, all top-level formulae will have names. The names may be scrambled by the benchmark scrambler.

The competition benchmarks were selected from these divisions:

Format for Results Files

Note: The Results archives that were provided with every benchmarks archive are lost and not available anymore. The following information is obsolete and only kept for archival purposes.

Each tarball contains one or two status files for each benchmark:

These files contain one status per line, corresponding to the series of (check-sat) commands in the benchmarks.

Each status line (where different from "unknown") has been determined by running at least 3 different SMT solvers on the set of instances resulted from "unfolding" each incremental benchmark using the scrambler. For *.results.txt, all the results different from "unknown" have been reported by at least 2 different solvers, whereas all the status lines generated by a single solver only (because e.g. the others timed out) have been replaced with "unknown". For *.results_untrusted.txt, the single-solver answer is also included. However, they are marked with an "# untrusted" comment.