My DCG is very slow to parse #1517

euanlacy · 2022-06-20T16:56:37Z

euanlacy
Jun 20, 2022

Hi, I've written a prolog script to read in output from a cpp benchmarking library, but it takes 10s of seconds to parse, which seems excessive. I was wondering if I've done anything obviously wrong which is affecting performance?

:- use_module(library(dcgs)).
:- use_module(library(lists)).
:- use_module(library(charsio)).
:- use_module(library(pio)).

ws --> [W], { char_type(W, whitespace) }, ws.
ws --> [].

string([]) --> [].
string([X|Xs]) --> [X], string(Xs), { char_type(X, alpha); memberchk(X, "_") }.

num([]) --> [].
num([X|Xs]) --> [X], num(Xs), {char_type(X, numeric)}.

decimal(N) --> num(A), ".", num(B), { append(A, ".", A0), append(A0, B, N) }.

unit(ms) --> "ms".
unit(ns) --> "ns".
unit(us) --> "us".
unit(s) --> "s".
unit(m) --> "m".

result(result(N, unit(U))) --> decimal(N), " ", unit(U).

benchmark_header(Tag, Type) --> string(Tag), " - ", string(Type).
benchmark_body(Name, Mean, Std_dev) --> 
    string(Name), ws, num(_Samples), ws, num(_Iterations), ws, result(_),
    ws, result(Mean), ws, result(_LowMean), ws, result(_HighMean),
    ws, result(Std_dev), ws, result(_LowStd_dev), ws, result(_HighStd_dev), ws, !.

benches([body(N, M , S)]) --> benchmark_body(N, M, S), ws.
benches([body(N, M , S)|Bs]) --> benchmark_body(N, M, S), ws, benches(Bs).

benchmark(header(Tag, Type), Bs) -->
    "-------------------------------------------------------------------------------\n",
    benchmark_header(Tag, Type), ws,
    "-------------------------------------------------------------------------------\n",
    string(_File), ".cpp:", num(_Line), ws,
    "...............................................................................\n\n",
    "benchmark name                       samples       iterations    estimated", ws,
    "                                     mean          low mean      high mean", ws,
    "                                     std dev       low std dev   high std dev", ws,
    "-------------------------------------------------------------------------------\n",
    benches(Bs).

Which parses output like this:

-------------------------------------------------------------------------------
Rewrite - Naive_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:27
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
aa                                             100          3430      5.831 ms
                                        17.4526 ns     17.422 ns    17.5074 ns
                                        0.20315 ns   0.127525 ns   0.313848 ns

bbabb                                          100           644     5.9892 ms
                                         100.01 ns    98.7541 ns    102.023 ns
                                        7.94613 ns    5.76743 ns    11.9486 ns

aabbb                                          100          3450      5.865 ms
                                        17.8225 ns    17.6039 ns    18.1568 ns
                                        1.36058 ns    0.96424 ns    1.77641 ns

zz                                             100          1019     6.0121 ms
                                        58.3541 ns    58.1912 ns    58.5919 ns
                                       0.999013 ns    0.74833 ns    1.34206 ns

cbbaab                                         100          1553     5.9014 ms
                                        39.0769 ns    38.8166 ns     39.469 ns
                                        1.61424 ns    1.19094 ns    2.22586 ns

abbaaba                                        100          3472     5.9024 ms
                                        17.3251 ns    17.2951 ns    17.3868 ns
                                        0.21103 ns   0.115268 ns   0.352663 ns

EricGT · 2022-06-20T17:06:11Z

EricGT
Jun 20, 2022

{ char_type(X, alpha); memberchk(X, "_") }.

Without actually running the code my guess would be that the ; is leaving a choice point.

Try this instead.

{ char_type(X, alpha), ! ; memberchk(X, "_") }.

After further looking at the rest of the predicates, most of the others also need cuts, !.

3 replies

euanlacy Jun 20, 2022
Author

Thank you for helping!
Making the first change appears to stop string from parsing anything longer than one character. If the goals are reordered to

([X|Xs]) --> [X], { char_type(X, alpha), ! ; memberchk(X, "_") }, string(Xs).

it parses again, but takes just as long.

I am struggling to add cuts anywhere without also preventing it from parsing, could you perhaps be more specific? I've never optimized prolog code before, so have no idea what I'm doing!

EricGT Jun 20, 2022

Can you give at least two different sample inputs so that I can test against. I know the input looks simple but having real world input is better. Also I work with SWI-Prolog and use open list as opposed to closed list with DCGs when possible so any answer I post will be for SWI-Prolog but should be very close to what is needed for Scryer Prolog.

euanlacy Jun 20, 2022
Author

Here is the full output of the benchmark suite, the grammar is incomplete right now.

-------------------------------------------------------------------------------
Rewrite - Naive_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:27
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
aa                                             100          3469     5.8973 ms
                                        17.4145 ns    17.3583 ns     17.503 ns
                                       0.352022 ns   0.255109 ns   0.502213 ns

bbabb                                          100           651     6.1194 ms
                                        94.5881 ns      94.18 ns     95.612 ns
                                        3.04855 ns    1.12093 ns    5.72211 ns

aabbb                                          100          3532     6.0044 ms
                                        17.8783 ns    17.6207 ns    18.3697 ns
                                        1.75528 ns    1.02169 ns    2.65344 ns

zz                                             100          1003     6.1183 ms
                                        59.2874 ns    59.0872 ns     59.652 ns
                                          1.341 ns   0.866538 ns    2.13227 ns

cbbaab                                         100          1570      6.123 ms
                                         39.205 ns    39.0694 ns    39.4488 ns
                                       0.899215 ns   0.560321 ns    1.34959 ns

abbaaba                                        100          3527     5.9959 ms
                                        18.5495 ns    18.2273 ns    18.9702 ns
                                         1.8602 ns    1.51214 ns    2.19985 ns


-------------------------------------------------------------------------------
Rewrite - Ordered_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:27
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
aa                                             100          3771     6.0336 ms
                                        16.3393 ns    16.2753 ns     16.483 ns
                                       0.464891 ns   0.208647 ns   0.821375 ns

bbabb                                          100          1005     6.1305 ms
                                        60.6625 ns    60.4505 ns    60.9911 ns
                                        1.32875 ns   0.946219 ns    1.78052 ns

aabbb                                          100          3782     6.0512 ms
                                        16.3161 ns    16.2513 ns    16.4652 ns
                                       0.471539 ns   0.194508 ns   0.818178 ns

zz                                             100          1513      6.052 ms
                                        40.7396 ns    40.6229 ns    40.9417 ns
                                       0.763972 ns   0.500201 ns    1.10101 ns

cbbaab                                         100          2099     6.0871 ms
                                        28.9026 ns      28.81 ns    29.1426 ns
                                       0.705704 ns   0.331208 ns    1.44644 ns

abbaaba                                        100          3761     6.0176 ms
                                        16.3787 ns    16.3148 ns    16.5004 ns
                                       0.435521 ns   0.269352 ns    0.71123 ns


-------------------------------------------------------------------------------
Rewrite - Trie_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:27
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
aa                                             100          3732     5.9712 ms
                                        16.6483 ns    16.6047 ns    16.7429 ns
                                       0.311546 ns   0.165281 ns   0.523437 ns

bbabb                                          100          2632     6.0536 ms
                                        23.3966 ns     23.299 ns    23.6111 ns
                                       0.701911 ns   0.327419 ns    1.24484 ns

aabbb                                          100          3755      6.008 ms
                                        16.5762 ns    16.5116 ns    16.7398 ns
                                       0.473691 ns    0.12377 ns   0.855774 ns

zz                                             100          3361     6.0498 ms
                                         18.311 ns    18.2847 ns    18.3711 ns
                                       0.192634 ns   0.104879 ns   0.366017 ns

cbbaab                                         100          3550      6.035 ms
                                        17.7527 ns    17.6867 ns    17.8384 ns
                                       0.380023 ns   0.285608 ns   0.563568 ns

abbaaba                                        100          3759     6.0144 ms
                                        16.3686 ns    16.3253 ns    16.4373 ns
                                       0.272904 ns   0.190466 ns   0.381374 ns


-------------------------------------------------------------------------------
Old Knuth-Bendix
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:80
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
old kbs nbrute_force                           100             1     22.508 ms
                                        228.501 us    227.166 us    230.431 us
                                        8.10412 us    6.22534 us    12.0804 us


-------------------------------------------------------------------------------
Knuth-Bendix - Naive_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:97
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
"Example 5.4"                                  100             2     10.972 ms
                                        54.6235 us    54.4281 us    54.8977 us
                                        1.17733 us    897.693 ns    1.60009 us

"Libsemi-group test 010"                       100             1     2.59548 m
                                         1.58115 s     1.57788 s     1.58497 s
                                        17.9447 ms    15.2908 ms    21.3168 ms


-------------------------------------------------------------------------------
Knuth-Bendix - Ordered_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:97
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
"Example 5.4"                                  100             3     9.1551 ms
                                        30.3124 us    30.1639 us    30.5057 us
                                        859.394 ns    676.916 ns    1.10707 us

"Libsemi-group test 010"                       100             1     6.30673 s
                                        62.8143 ms    62.6804 ms    62.9694 ms
                                         738.62 us    635.483 us    864.688 us


-------------------------------------------------------------------------------
Knuth-Bendix - Trie_Wrap
-------------------------------------------------------------------------------
bench_knuth_bendix.cpp:97
...............................................................................

benchmark name                       samples       iterations    estimated
                                     mean          low mean      high mean
                                     std dev       low std dev   high std dev
-------------------------------------------------------------------------------
"Example 5.4"                                  100             2     7.1206 ms
                                        35.5352 us    35.2784 us    36.0645 us
                                        1.79458 us     1.0824 us    3.51272 us

"Libsemi-group test 010"                       100             1     2.05433 s
                                         20.359 ms    20.2873 ms    20.4541 ms
                                        420.949 us    332.669 us    537.058 us


===============================================================================

pmoura · 2022-06-20T17:33:48Z

pmoura
Jun 20, 2022

string([]) --> [].
string([X|Xs]) --> [X], string(Xs), { char_type(X, alpha); memberchk(X, "_") }.

num([]) --> [].
num([X|Xs]) --> [X], num(Xs), {char_type(X, numeric)}.

Try instead:

string([X|Xs]) --> [X], {once((char_type(X, alpha); memberchk(X, "_")))}, !, string(Xs).
string([]) --> [].

num([X|Xs]) --> [X], {char_type(X, numeric)}, !, num(Xs).
num([]) --> [].

I.e. do eager parsing instead of lazy parsing for those two non-terminals.

0 replies

EricGT · 2022-06-20T17:38:40Z

EricGT
Jun 20, 2022

Paulo gave more of what needs changing.

Also this needs some cuts

unit(ms) --> "ms".  
unit(ns) --> "ns".  
unit(us) --> "us".  
unit(s) --> "s".  
unit(m) --> "m".

most likely

unit(ms) --> "ms", !.  
unit(ns) --> "ns", !.  
unit(us) --> "us", !.  
unit(s) --> "s", !.  
unit(m) --> "m".

0 replies

euanlacy · 2022-06-20T17:52:03Z

euanlacy
Jun 20, 2022
Author

Thanks for the suggestions, it is much faster now!

This is what I have currently:

:- use_module(library(dcgs)).
:- use_module(library(lists)).
:- use_module(library(charsio)).
:- use_module(library(pio)).

ws --> [W], { char_type(W, whitespace) }, ws.
ws --> [].

string([X|Xs]) --> [X], { once((char_type(X, alpha) ; memberchk(X, "_"))) }, !, string(Xs).
string([]) --> [].

num([X|Xs]) --> [X], {char_type(X, numeric)}, !, num(Xs).
num([]) --> [].

decimal(N) --> num(A), ".", num(B), { append(A, ".", A0), append(A0, B, N) }.

unit(ms) --> "ms", !.
unit(ns) --> "ns", !.
unit(us) --> "us", !.
unit(s) --> "s", !.
unit(m) --> "m".

result(result(N, unit(U))) --> decimal(N), !, " ", unit(U), !.

benchmark_header(Tag, Type) --> string(Tag), " - ", string(Type).
benchmark_body(Name, Mean, Std_dev) --> 
    string(Name), !, ws, !, num(_Samples), !, ws, !, num(_Iterations), !, ws, !, result(_),
    !, ws, !, result(Mean), !, ws, !, result(_LowMean), !, ws, !, result(_HighMean),
    !, ws, !, result(Std_dev), !, ws, !, result(_LowStd_dev), !, ws, !, result(_HighStd_dev), !, ws, !.

benches([body(N, M , S)]) --> benchmark_body(N, M, S), ws.
benches([body(N, M , S)|Bs]) --> benchmark_body(N, M, S), ws, benches(Bs).

benchmark(header(Tag, Type), Bs) -->
    "-------------------------------------------------------------------------------\n",
    benchmark_header(Tag, Type), !, ws, !,
    "-------------------------------------------------------------------------------\n",
    string(_File), ".cpp:", num(_Line), !, ws, !,
    "...............................................................................\n\n",
    "benchmark name                       samples       iterations    estimated", !, ws,
    "                                     mean          low mean      high mean", !, ws,
    "                                     std dev       low std dev   high std dev", !, ws,
    "-------------------------------------------------------------------------------\n", !,
    benches(Bs).

I am not sure if all my cuts are in sensible places, so any comments would be appreciated, but I think I understand the idea of adding the cuts!.

2 replies

EricGT Jun 20, 2022

I am not sure if all my cuts are in sensible places

No, but it nice to see you embracing cuts.

I am working on a complete example but it will take about an hour or two.

Also, thanks for the other examples.

Are they meant to be parsed together or should they be broken up? It should not be much harder to parse them as listed.

euanlacy Jun 20, 2022
Author

I'm not yet sure whether I want to parse them together or not. I'd rather not have a complete example as I'm writing it to help me benchmark something as part of my final year university project, and would like to talk about it in the software engineering section of my report! Any general advice is very much appreciated though!

pmoura · 2022-06-20T17:58:33Z

pmoura
Jun 20, 2022

For the units, you can also do a look-ahead to benefit from first-argument indexing:

result(result(N, unit(U))) --> decimal(N), " ", [C], unit(C, U).

unit(n, ns) --> "s".
unit(u, us) --> "s".
unit(s, s) --> [].
unit(m, ms) --> "s", !.
unit(m, m) --> [].

0 replies

EricGT · 2022-06-20T18:02:44Z

EricGT
Jun 20, 2022

As I noted I work with SWI-Prolog.
For SWI-Prolog char_type/2 has white and space but no whitespace.

?- char_type(W,white).
W = '\t' ;
W = ' ' ;
false.

?- char_type(W,space).
W = '\t' ;
W = '\n' ;
W = '\v' ;
W = '\f' ;
W = '\r' ;
W = ' ' ;
W = '\u00A0' ;
W = '\u1680' ;
W = '\u180E' ;
W = '\u2000' ;
W = '\u2001' ;
W = '\u2002' ;
W = '\u2003' ;
W = '\u2004' ;
W = '\u2005' ;
W = '\u2006' ;
W = '\u2007' ;
W = '\u2008' ;
W = '\u2009' ;
W = '\u200A' ;
W = '\u2028' ;
W = '\u2029' ;
W = '\u202F' ;
W = '\u205F' ;
W = '\u3000' ;
false.

Can you list the similar results for Scryer. I am curious what it shows. (Sorry I don't have Scryer currently installed).

0 replies

triska · 2022-06-20T19:01:30Z

triska
Jun 20, 2022

I think by far the most significant performance improvement you can make to this code is to test earlier whether a described character is of the desired type. In particular, if you change the two rules:

string([X|Xs]) --> [X], string(Xs), { char_type(X, alpha); memberchk(X, "_") }.

num([X|Xs]) --> [X], num(Xs), {char_type(X, numeric)}.

to, respectively:

string([X|Xs]) --> [X], { char_type(X, alpha); memberchk(X, "_") }, string(Xs).

num([X|Xs]) --> [X], {char_type(X, numeric)}, num(Xs).

then the time for parsing the file decreases dramatically. Note that adding !//0 to your grammar makes the description less general and no longer amenable to declarative reasoning with failure slices, generalizations etc., which are very severe drawbacks.

1 reply

triska Jun 20, 2022

In addition, note also that memberchk(X, "_") is equivalent to X = '_', and Prolog also tells us that:

?- memberchk(X, "_").
   X = '_'.

EricGT · 2022-06-20T21:11:51Z

EricGT
Jun 20, 2022

As a follow up to what Markus is saying, see: https://stackoverflow.com/a/12942551/1243762

0 replies

EricGT · 2022-06-21T10:12:51Z

EricGT
Jun 21, 2022

In your larger example input there is a problem

Old Knuth-Bendix is not valid, it is a tag only.

Based on the DCG given would have expected something like

Old Knuth-Bendix - Naive_Wrap

which is a tag and type separated by -

Should the DCG be modified or should the input be changed?

0 replies

EricGT · 2022-06-21T10:28:59Z

EricGT
Jun 21, 2022

Here is a much more complete example done using SWI-Prolog. Since this a project for your course you can learn from it and use it for your talk. Take from it what you like and pass on what you don't.

I know the expected comments will be

Use of cut when it should be pure. (If someone can show a working pure version that keeps the regular expression like predicates, e.g. * and +, I would be interested in it.)
Use of difference list (open list) should use A-B instead of A,B. (Old habits don't die.)
Where is the Scryer version? (At present I just wanted to get euanlacy some feedback on how to make the code faster, not do the assignment).
Why not use characters instead of codes. (Old habits don't die.)
Why the extra code for things like line counts and peek? (I left these in so others can learn from it.)

:- module(example,
    [
        check/1
    ]).

% -----------------------------------------------------------------------------

:- set_prolog_flag(double_quotes, codes).
:- set_prolog_flag(back_quotes, string).

% ----------------------------------------------------------------------------

:- nb_setval(line_count,0).

inc(line_count) :-
    nb_getval(line_count,Line_count0),
    Line_count is Line_count0 + 1,
    nb_setval(line_count,Line_count).

ws --> ws(_).
'ws+' --> 'ws+'(_).
'ws*' --> 'ws*'(_).

'ws+'([H|T]) -->
    ws(H),
    'ws*'(T).

'ws*'([H|T]) -->
    ws(H), !,
    'ws*'(T).
'ws*'([]) --> [].

ws(0x20) --> " ".
ws(0x0A) -->
     "\n",
    { inc(line_count) }.
ws(0x0D) --> "\r".
ws(0x09) --> "\t".

eol --> eol(_).

eol(eol(0x0D,0x0A)) -->
    "\r\n",
    !,
    { inc(line_count) }.
eol(eol(0x0D)) --> "\r", !.
eol(eol(0x0A)) -->
    "\n",
    { inc(line_count) }.

string --> string(_).

string(string(String)) -->
    'string_char+'(Codes),
    { string_codes(String,Codes) }.

'string_char+'([H|T]) -->
    string_char(H),
    'string_char*'(T).

'string_char*'([H|T]) -->
    string_char(H), !,
    'string_char*'(T).
'string_char*'([]) --> [].

string_char(C) -->
    [C],
    { char_type(C,alpha) }, !.
string_char(0'_) --> "_".

num --> num(_).

num(num(Number)) -->
    'digit+'(Digits),
    { number_codes(Number,Digits) }.

decimal(decimal(Decimal)) -->
    'digit+'(T0,T1),
    period(T1,T2),
    'digit+'(T2,T),
    {
        T = [],
        number_codes(Decimal,T0)
    }.

% closed list variation
'digit+'([H|T]) -->
    digit(H),
    'digit*'(T).

% closed list variation
'digit*'([H|T]) -->
    digit(H), !,
    'digit*'(T).
'digit*'([]) --> [].

% open list variation
'digit+'(T0,T) -->
    digit(T0,T1),
    'digit*'(T1,T).

% open list variation
'digit*'(T0,T) -->
    digit(T0,T1), !,
    'digit*'(T1,T).
'digit*'(T,T) --> [].

% open list variation
digit(T0,T) -->
    digit(C),
    { T0 = [C|T] }.

digit(C) -->
    [C],
    { between(0'0,0'9,C) }, !.

% open list variation
period(T0,T) -->
    period(C),
    { T0 = [C|T] }.

period(0'.) -->
    ".".

unit(0'n, unit(ns)) --> "s".
unit(0'u, unit(us)) --> "s".
unit(0's, unit(s)) --> [].
unit(0'm, unit(ms)) --> "s", !.
unit(0'm, unit(m)) --> [].

result --> result(_).

result(result(N, U)) -->
    decimal(decimal(N)),
    " ",
    [C], unit(C, unit(U)).

benchmark_detail(benchmark_detail(Name, mean(Mean_value,Mean_unit), std_dev(Std_dev_value,Std_dev_unit))) -->
    string(string(Name)), 'ws+', num, 'ws+', num, 'ws+', result, eol,
    'ws+', result(result(Mean_value,Mean_unit)), 'ws+', result, 'ws+', result, eol,
    'ws+', result(result(Std_dev_value,Std_dev_unit)), 'ws+', result, 'ws+', result, eol, !.

'benchmark_detail+'([H|T]) -->
    'ws*',
    benchmark_detail(H),
    'benchmark_detail*'(T).

'benchmark_detail*'([H|T]) -->
    'ws*',
    benchmark_detail(H), !,
    'benchmark_detail*'(T).
'benchmark_detail*'([]) -->
    'ws*'.

benchmark_header(benchmark_header(Tag, Type)) -->
    string(string(Tag)),
    " - ",
    string(string(Type)).

benchmark_detail_header(benchmark_detail_header(Tag,Type)) -->
    "-------------------------------------------------------------------------------",eol,
    benchmark_header(benchmark_header(Tag, Type)),eol,
    "-------------------------------------------------------------------------------",eol,
    string, ".cpp:", num, eol,
    "...............................................................................",eol,
    eol,
    "benchmark name                       samples       iterations    estimated",eol,
    "                                     mean          low mean      high mean",eol,
    "                                     std dev       low std dev   high std dev",eol,
    "-------------------------------------------------------------------------------",eol.

% closed list variation
'benchmark+'([H|T]) -->
    'ws*',
    benchmark(H),
    'benchmark*'(T).

% closed list variation
'benchmark*'([H|T]) -->
    'ws*',
    benchmark(H), !,
    'benchmark*'(T).
'benchmark*'([]) --> [].

benchmark(benchmark(Tag, Type, Details)) -->
    benchmark_detail_header(benchmark_detail_header(Tag,Type)),
    'benchmark_detail+'(Details).

benchmarks(Benchmarks) -->
    % { gtrace },
    'benchmark+'(Benchmarks),
    'ws*',
    (
        "===============================================================================", !
    ;
        []
    ),
    'ws*'.

lookahead(C1,C2,C3),[C1,C2,C3] --> [C1],[C2],[C3].

peek -->
    lookahead(C1,C2,C3),
    {
        format('~d,~d,~d~n',[C1,C2,C3]),
        char_code(Ch_1,C1),
        char_code(Ch_2,C2),
        char_code(Ch_3,C3),
        format('~p,~p,~p~n',[Ch_1,Ch_2,Ch_3])
    }.

input_file('benchmarks.txt').

check(1) :-
    input_file(Input_file),
    DCG = benchmarks(Benchmarks),
    phrase_from_file(DCG, Input_file),
    print_term(Benchmarks,[]).

check(2) :-
    setup_call_cleanup(
        (
            input_file(Input_file),
            open(Input_file,read,Input_stream)
        ),
        (
            read_stream_to_codes(Input_stream, Codes),
            DCG = benchmarks(Benchmarks),
            phrase(DCG,Codes,Rest)
        ),
        (
            assertion( Rest == [] ),
            close(Input_stream)
        )
    ),
    print_term(Benchmarks,[]).

Example run. Only demonstrates first two because it fails on Old Knuth-Bendix which is an input problem not a code problem.

?- check(1).
[ benchmark("Rewrite",
            "Naive_Wrap",
            [ benchmark_detail("aa",mean(17.4145,ns),std_dev(0.352022,ns)),
              benchmark_detail("bbabb",mean(94.5881,ns),std_dev(3.04855,ns)),
              benchmark_detail("aabbb",mean(17.8783,ns),std_dev(1.75528,ns)),
              benchmark_detail("zz",mean(59.2874,ns),std_dev(1.341,ns)),
              benchmark_detail("cbbaab",mean(39.205,ns),std_dev(0.899215,ns)),
              benchmark_detail("abbaaba",mean(18.5495,ns),std_dev(1.8602,ns))
            ]),
  benchmark("Rewrite",
            "Ordered_Wrap",
            [ benchmark_detail("aa",mean(16.3393,ns),std_dev(0.464891,ns)),
              benchmark_detail("bbabb",mean(60.6625,ns),std_dev(1.32875,ns)),
              benchmark_detail("aabbb",mean(16.3161,ns),std_dev(0.471539,ns)),
              benchmark_detail("zz",mean(40.7396,ns),std_dev(0.763972,ns)),
              benchmark_detail("cbbaab",mean(28.9026,ns),std_dev(0.705704,ns)),
              benchmark_detail("abbaaba",mean(16.3787,ns),std_dev(0.435521,ns))
            ])
]
true.

Hope you or others can learn something from it. Ask questions if you have any but I don't plan to do another complete example for any more questions. :)

0 replies

UWN · 2022-07-06T10:23:41Z

UWN
Jul 6, 2022

@euanlacy : Could you post your current version? There could be some further improvements.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

My DCG is very slow to parse #1517

{{title}}

Replies: 11 comments 6 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

My DCG is very slow to parse #1517

Replies: 11 comments · 6 replies

euanlacy Jun 20, 2022 Author

euanlacy Jun 20, 2022 Author

euanlacy Jun 20, 2022 Author

euanlacy Jun 20, 2022 Author

Replies: 11 comments 6 replies

euanlacy Jun 20, 2022
Author

euanlacy Jun 20, 2022
Author

euanlacy
Jun 20, 2022
Author

euanlacy Jun 20, 2022
Author