Back to siegfried

Siegfried development benchmarks

Fri, 27 Jul 2018 05:54:19 UTC

Environment

These benchmarks were automatically run on a t1.small.x86 machine provisioned from https://www.packet.net/.

Specs for the t1.small.x86: 4 Physical Cores @ 2.4 GHz; 8 GB DDR3 RAM; 80 GB SSD.

You can inspect the commands that were run to generate these benchmarks here.

iPRES Systems Showcase

A corpus created for the 2014 iPRES conference comprising 2,206 files (5GB). Represents a range of formats, including AV and some uncommon types. Sourced from http://www.webarchive.org.uk/datasets/ipres.ds.1/

Results

Tool Description Duration
master Master branch of github.com/richardlehane/siegfried. Corresponds to latest production release. 31.160217951s
develop Develop branch of github.com/richardlehane/siegfried. Tip of development and potentially unstable. 30.963125721s

The tools differed in output for 30 files in the corpus.

filemasterdevelop
/root/corpora/ipres-systems-showcase-files/ACCLINK.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/ANALYSF.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/ANALYSIS.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/ATPVBAEN.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/BSHXL.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/Balance Sheet.xltfmt/61fmt/62
/root/corpora/ipres-systems-showcase-files/EUROTOOL.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/EXPENSES.XLSfmt/473fmt/56
/root/corpora/ipres-systems-showcase-files/EXPTOOWS.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/ExpenseStatement.xltfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/FUNCRES.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/HTML.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/INVOICE.XLTfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/LABEL.WDBfmt/233fmt/111
/root/corpora/ipres-systems-showcase-files/LOOKUP.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/Loan Amortization.xltfmt/61fmt/62
/root/corpora/ipres-systems-showcase-files/PROCDB.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/QE.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/SALES.XLTfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/SOLVER.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/SUMIF.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/Sales Invoice.xltfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/TMPLTNUM.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/Timecard.xltfmt/61fmt/62
/root/corpora/ipres-systems-showcase-files/UPDTLINK.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/Village Software.xltfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/WEBFORM.XLAfmt/61fmt/111
/root/corpora/ipres-systems-showcase-files/WZTEMPLT.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/XLODBC.XLAfmt/59fmt/111
/root/corpora/ipres-systems-showcase-files/XLQUERY.XLAfmt/59fmt/111

Raw output

PRONOM files

A corpus created by Greg Lepore and comprising 1,205 files (2.1GB). Includes a single sample of as many of the PRONOM IDs (PUIDs) that Greg could find.

Results

Tool Description Duration
master Master branch of github.com/richardlehane/siegfried. Corresponds to latest production release. 3.857836972s
develop Develop branch of github.com/richardlehane/siegfried. Tip of development and potentially unstable. 3.940533185s

The tools differed in output for 4 files in the corpus.

filemasterdevelop
/root/corpora/pronom-files/fmt_128_OpenDocument Text_de.qwerkop.www_projects_mspace_doku.sdwfmt/136fmt/128
/root/corpora/pronom-files/fmt_129_OpenDocument Spreadsheet_Sauvetage_PL_TP.sxcfmt/137fmt/129
/root/corpora/pronom-files/fmt_137_OpenDocument_Spreadsheet_chart-range-import.sxcfmt/137x-fmt/263
/root/corpora/pronom-files/x-fmt_84_Microsoft_Powerpoint_Design_Template.POTfmt/126fmt/111

Raw output

Govdocs (Selected)

A selection from the Govdocs1 corpus comprising 26,124 files (31.4GB). Represents typical office formats, including approx. 15,000 PDFs. Originally sourced from http://openpreservation.org/blog/2012/07/26/1-million-21000-reducing-govdocs-significantly/

Results

Tool Description Duration
master Master branch of github.com/richardlehane/siegfried. Corresponds to latest production release. 3m11.649016359s
develop Develop branch of github.com/richardlehane/siegfried. Tip of development and potentially unstable. 9m20.895460128s

The tools differed in output for 126 files in the corpus.

filemasterdevelop
/root/corpora/govdocs-selected/CSV_16/096379.csvfmt/61fmt/111
/root/corpora/govdocs-selected/CSV_16/616072.csvfmt/61fmt/111
/root/corpora/govdocs-selected/CSV_16/675163.csvfmt/61fmt/62
/root/corpora/govdocs-selected/CSV_17/113892.csvfmt/61fmt/62
/root/corpora/govdocs-selected/CSV_17/477637.csvfmt/61fmt/111
/root/corpora/govdocs-selected/CSV_17/483360.csvfmt/61fmt/62
/root/corpora/govdocs-selected/CSV_17/609313.csvfmt/61fmt/62
/root/corpora/govdocs-selected/CSV_17/778548.csvfmt/61fmt/62
/root/corpora/govdocs-selected/CSV_17/922895.csvfmt/61fmt/62
/root/corpora/govdocs-selected/DOC_135/862565.zipfmt/523x-fmt/263
/root/corpora/govdocs-selected/HTML_124/462908.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_143/558860.htmlx-fmt/394fmt/99
/root/corpora/govdocs-selected/HTML_29/125709.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/137485.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/228989.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/231543.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/260625.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/266760.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/304937.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/376530.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/408767.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_29/455959.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_61/109253.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_68/143214.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_68/276772.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_72/164860.htmlfmt/61fmt/62
/root/corpora/govdocs-selected/HTML_72/247864.htmlfmt/61fmt/62
/root/corpora/govdocs-selected/HTML_72/338953.htmlfmt/61fmt/62
/root/corpora/govdocs-selected/HTML_72/564203.htmlfmt/61fmt/111
/root/corpora/govdocs-selected/HTML_72/579101.htmlfmt/61fmt/62
/root/corpora/govdocs-selected/HTML_74/164869.htmlfmt/126fmt/111
/root/corpora/govdocs-selected/HTML_74/971338.htmlfmt/126fmt/111
/root/corpora/govdocs-selected/HTML_76/173826.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/215336.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/238780.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/276768.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/324608.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/400269.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/559657.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_76/560715.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_81/194038.htmlfmt/61fmt/111
/root/corpora/govdocs-selected/HTML_86/247863.htmlfmt/61fmt/111
/root/corpora/govdocs-selected/HTML_86/879975.htmlfmt/61fmt/111
/root/corpora/govdocs-selected/HTML_86/880332.htmlfmt/61fmt/111
/root/corpora/govdocs-selected/HTML_87/249088.htmlfmt/126fmt/111
/root/corpora/govdocs-selected/HTML_91/297609.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_91/862543.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_93/304942.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_94/310021.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/HTML_95/310778.htmlfmt/40fmt/111
/root/corpora/govdocs-selected/KML_8/486133.kmlfmt/724x-fmt/263
/root/corpora/govdocs-selected/PDF_1562/661753.pdffmt/134fmt/15
/root/corpora/govdocs-selected/PDF_1616/553738.pdffmt/134fmt/18
/root/corpora/govdocs-selected/PDF_1631/825237.pdffmt/134fmt/17
/root/corpora/govdocs-selected/PDF_169/915424.pdffmt/134fmt/17
/root/corpora/govdocs-selected/PDF_246/960747.pdffmt/134fmt/16
/root/corpora/govdocs-selected/PDF_3230/900709.pdffmt/134fmt/18
/root/corpora/govdocs-selected/PDF_608/113057.pdffmt/134fmt/19
/root/corpora/govdocs-selected/PPS_1/424289.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/444773.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/468159.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/595101.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/694512.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/724719.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/727240.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/883180.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/887669.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_1/925160.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_10/318062.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_11/370208.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/219264.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/246872.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/249164.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/427081.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/690540.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/691123.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/692615.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/752637.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/760806.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_2/764092.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/042949.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/262754.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/343072.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/375593.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/613430.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/656831.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/664695.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/666705.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/673464.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_3/760665.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/021626.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/281474.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/284610.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/353108.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/377873.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/616855.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/660218.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/675338.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/728413.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_4/922892.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/250540.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/317841.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/334478.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/411334.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/421710.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/431713.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/675924.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/681118.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/727242.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_6/926200.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_7/122904.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_7/234520.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_7/369635.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_7/915553.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPS_8/226216.ppsfmt/126fmt/111
/root/corpora/govdocs-selected/PPT_45/405374.zipfmt/215x-fmt/263
/root/corpora/govdocs-selected/PPT_45/947671.zipfmt/215x-fmt/263
/root/corpora/govdocs-selected/PPT_46/472071.zipfmt/487x-fmt/263
/root/corpora/govdocs-selected/PPT_46/973194.zipfmt/215x-fmt/263
/root/corpora/govdocs-selected/SWF_12/554614.swffmt/134fmt/505
/root/corpora/govdocs-selected/SWF_12/628689.swffmt/134fmt/505
/root/corpora/govdocs-selected/XLS_162/508651.zipfmt/214x-fmt/263
/root/corpora/govdocs-selected/XLS_162/606177.zipfmt/214x-fmt/263
/root/corpora/govdocs-selected/XLS_187/654115.zipfmt/214x-fmt/263
/root/corpora/govdocs-selected/XLS_188/654117.zipfmt/214x-fmt/263
/root/corpora/govdocs-selected/XML_22/555096.xmlfmt/40fmt/111

Raw output

Profile

profiler information for siegfried development branch

History

2018-10-10 11:24:35 +0000 UTC

2018-09-19 01:50:01 +0000 UTC

2018-08-30 07:36:32 +0000 UTC

2018-08-27 06:10:52 +0000 UTC

2018-08-21 06:00:13 +0000 UTC

2018-07-27 05:54:19 +0000 UTC