bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-03-09 12:23:04 +00:00

Author	SHA1	Message	Date
Akim Demaille	03d33fd3a4	skeletons: better comments for some tables And also, remove the incorrect indentation of these comments: - /* YYR2[YYN] -- Number of symbols on the right hand side of rule YYN. / +/ YYR2[RULE-NUM] -- Number of symbols on the right-hand side of rule RULE-NUM. / static const yytype_int8 yyr2[] = { 0, 2, 4, 0, 2, 1, 1, 1, 3, 2, I don't remember why this indentation was added (in `0991e29b75`), but it seems wrong, at least for yacc.c. I suspect this was done with lalr1.cc (where this is embeded in the class definition, so it should be indented), but today lalr1.cc uses other routines to output these comments. data/skeletons/bison.m4 (b4_integral_parser_tables_map): Improve the wording of the comments of some tables. * data/skeletons/c.m4 (b4_integral_parser_table_define): Remove indentation.	2020-12-20 14:54:46 +01:00
Akim Demaille	23472033ee	Merge branch 'maint' * maint: c++: shorten the assertions that check whether tokens are correct c++: don't glue functions together lalr1.cc: YY_ASSERT should use api.prefix c++: don't use YY_ASSERT at all if parse.assert is disabled c++: style: follow the Bison m4 quoting pattern yacc.c: provide the Bison version as an integral macro regen style: make conversion of version string to int public %require: accept version numbers with three parts ("3.7.4") yacc.c: fix #definition of YYEMPTY gnulib: update doc: fix incorrect section title doc: minor grammar fixes in counterexamples section	2020-11-13 07:01:19 +01:00
Akim Demaille	0264b4bca0	c++: don't glue functions together * data/skeletons/bison.m4 (b4_type_foreach): Accept a separator. * data/skeletons/c++.m4: Use it. And fix an incorrect comment.	2020-11-13 06:17:52 +01:00
Akim Demaille	21c147b6e5	yacc.c: provide the Bison version as an integral macro Suggested by Balazs Scheidler. https://github.com/akimd/bison/issues/55 * src/muscle-tab.c (muscle_init): Move/rename `b4_version` to/as... * src/output.c (prepare): `b4_version_string`. Also define `b4_version`. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/d.m4, * data/skeletons/java.m4: Adjust. * doc/bison.texi: Document it.	2020-11-11 09:08:57 +01:00
Akim Demaille	5a31cda4c3	style: avoid explicit symbol numbers This should have been part of commit "symbols: stop dealing with YYEMPTY as b4_symbol(-2, ...)" (`cd40ec9526`). Give names to all the special symbols: "eof", "error" and "undef". * data/skeletons/bison.m4 (b4_symbol): Let `b4_symbol(eof, ...)` mean `b4_symbol(0, ...)`, `b4_symbol(error, ...)` mean `b4_symbol(1, ...)`, and , `b4_symbol(undef, ...)` mean `b4_symbol(2, ...)`.. * data/skeletons/c.m4, data/skeletons/glr.c, data/skeletons/glr.cc, * data/skeletons/glr2.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, * data/skeletons/yacc.c: Prefer symbols to numbers.	2020-11-07 16:58:47 +01:00
Akim Demaille	e66673aa64	m4: have b4_percent_define_if_define apply default values lazily Currently `b4_percent_define_ifdef([foo])` assigns a default value to `foo` when invoked. As a consequence, skeletons such as lalr1.d cannot specify their specific default values: `foo` was defined in bison.m4. Instead, provide `foo` with a default value when `b4_foo_if` is invoked. I could not measure a runtime difference between both cases. * data/skeletons/bison.m4 (_b4_percent_define_define): New. Helps getting rid of spurious indentation that resulted in spurious white space in the output. (b4_percent_define_if_define): Move the definition to... (_b4_percent_define_if_define): when the defined macros is called.	2020-10-03 09:17:53 +02:00
Akim Demaille	cd40ec9526	symbols: stop dealing with YYEMPTY as b4_symbol(-2, ...) * data/skeletons/bison.m4 (b4_symbol): Redirect `b4_symbol(empty, ...)` to `b4_symbol(-2, ...)`. Change all uses of the latter to the former.	2020-09-29 06:49:31 +02:00
Akim Demaille	d9cf99b6a5	multistart: use b4_accept instead of action post-processing For each start symbol, generate a parsing function with a richer return value than the usual of yyparse. Reserve a place for the returned semantic value, in order to avoid having to pass a pointer as argument to "return" that value. This also makes the call to the parsing function independent of whether a given start-symbol is typed. For instance, if the grammar file contains: %type <int> expression %start input expression (so "input" is valueless) we get typedef struct { int yystatus; } yyparse_input_t; yyparse_input_t yyparse_input (void); typedef struct { int yyvalue; int yystatus; } yyparse_expression_t; yyparse_expression_t yyparse_expression (void); This commit also changes the implementation of the parser termination: when there are multiple start symbols, it is the initial rules that explicitly YYACCEPT. They do that after having exported the start-symbol's value (if it is typed): switch (yyn) { case 1: /* $accept: YY_EXPRESSION expression $end / { ((yyvalue).TOK_expression) = (yyvsp[-1].TOK_expression); YYACCEPT; } break; case 2: /* $accept: YY_INPUT input $end / { YYACCEPT; } break; I have tried several ways to deal with termination, and this is the one that appears the best one to me. It is also the most natural. src/scan-code.h, src/scan-code.l (obstack_for_actions): New. * src/reader.c (grammar_rule_check_and_complete): Generate the actions of the rules for each start symbol. * data/skeletons/bison.m4 (b4_symbol_slot): New, with safer semantics than type and type_tag. * data/skeletons/yacc.c (b4_accept): New. Generates the body of the action of the start rules. (_b4_declare_sub_yyparse): For each start symbol define a dedicated return type for its parsing function. Adjust the declaration of its parsing function. (_b4_define_sub_yyparse): Adjust the definition of the function. * examples/c/lexcalc/parse.y: Check the case of valueless symbols. * examples/c/lexcalc/lexcalc.test: Check start symbols.	2020-09-27 09:44:18 +02:00
Akim Demaille	75c3746ce2	options: rename --defines as --header The name "defines" is incorrect, the generated file contains far more than just #defines. * src/getargs.h, src/getargs.c (-H, --header): New option. With optional argument, just like --defines, --xml, etc. (defines_flag): Rename as... (header_flag): this. Adjust dependencies. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/glr.c, * data/skeletons/glr.cc, data/skeletons/glr2.cc, data/skeletons/lalr1.cc, * data/skeletons/yacc.c: Adjust. * examples, doc/bison.texi: Adjust. * tests/headers.at, tests/local.at, tests/output.at: Convert most tests from using --defines to using --header.	2020-09-19 08:31:49 +02:00
Valentin Tolmer	ef09bf065a	glr2.cc: fork glr.cc to a c++ version This is a fork of glr.cc to be c++-first instead of a wrapper around glr.c. * data/skeletons/glr2.cc: New. * data/skeletons/bison.m4, data/skeletons/c++.m4: Adjust. * data/skeletons/c.m4 (b4_user_args_no_comma): New. * src/reader.c (grammar_rule_check_and_complete): glr2.cc is C++. * tests/actions.at, tests/c++.at, tests/calc.at, tests/conflicts.at, * tests/input.at, tests/local.at, tests/regression.at, tests/scanner.at, * tests/synclines.at, tests/types.at: Also check glr2.cc.	2020-08-30 10:45:21 +02:00
Akim Demaille	79bae4d6bb	kinds: use the symbol kinds where applicable Instead of generating switch statements with numbers, let's use the symbol kinds. Not only is this more readable, it also makes reading diff easier, as a change in symbol numbers won't have such a large effect on the implementation of symbol actions. * data/skeletons/bison.m4 (_b4_symbol_case): Use the symbol kind rather than its number.	2020-05-23 15:41:29 +02:00
Akim Demaille	23f7554188	kinds: also define the possibly qualified symbol kinds * data/skeletons/bison.m4 (b4_symbol_kind): Rename as... (b4_symbol_kind_base): this. (b4_symbol_kind): New, for fully qualified kind name. * data/skeletons/lalr1.cc (b4_symbol_kind): New. Adjust to use b4_symbol_kind where appropriate. * src/parse-gram.h, src/parse-gram.c: regen.	2020-05-23 15:41:29 +02:00
Akim Demaille	3ff248ebfe	m4: simplify useless quotation * data/skeletons/bison.m4: The result of b4_symbol is "quoted" already, no need for m4_expand.	2020-05-23 15:40:48 +02:00
Akim Demaille	4e1a5aae96	m4: use m4_shift2 etc. * data/skeletons/bison.m4 (m4_shift4): New. Use them where applicable.	2020-05-23 14:59:18 +02:00
Akim Demaille	e7aff57122	style: rename user_token_number as code This should have been done in 3.6, but I wanted to avoid introducing conflicts into Vincent's work on counterexamples. It turns out it's completely orthogonal. * data/README.md, data/skeletons/bison.m4, data/skeletons/c++.m4, * data/skeletons/c.m4, data/skeletons/glr.c, data/skeletons/java.m4, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, * data/skeletons/variant.hh, data/skeletons/yacc.c, src/conflicts.c, * src/derives.c, src/gram.c, src/gram.h, src/output.c, * src/parse-gram.c, src/parse-gram.y, src/print-xml.c, src/print.c, * src/reader.c, src/symtab.c, src/symtab.h, tests/input.at, * tests/types.at: s/user_token_number/code/g. Plus minor changes.	2020-05-23 08:43:58 +02:00
Akim Demaille	f80890c4bb	fix generated comments In Bison 3.6.2, the comments with brackets lose their brackets, for improper m4 quotation. * data/skeletons/bison.m4 (b4_gsub): New. * data/skeletons/c-like.m4 (_b4_comment): Use it. * tests/m4.at: Check b4_gsub.	2020-05-21 20:14:59 +02:00
Akim Demaille	cd4e799da4	error: rename the error token from YYERRCODE to YYerror See https://lists.gnu.org/r/bison-patches/2020-04/msg00162.html. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/glr.cc, * data/skeletons/lalr1.java, doc/bison.texi, * examples/c/bistromathic/parse.y, src/scan-gram.l, src/symtab.c (YYERRCODE): Rename as... (YYerror): this. Adjust dependencies.	2020-04-28 07:54:07 +02:00
Akim Demaille	286d0755f8	all: prefer YYERRCODE to YYERROR We will not keep YYERRCODE anyway, it causes backward compatibility issues. So as a first step, let all the skeletons use that name, until we have a better one. * data/skeletons/bison.m4, data/skeletons/glr.c, * data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, * data/skeletons/yacc.c, doc/bison.texi, tests/headers.at, * tests/input.at: here.	2020-04-26 15:09:51 +02:00
Akim Demaille	c4dbc1776c	skeletons: make the warning about implementation details clearer * data/skeletons/bison.m4 (b4_disclaimer): Here. * data/skeletons/lalr1.d, data/skeletons/lalr1.java: Use it.	2020-04-26 10:57:02 +02:00
Akim Demaille	bb7c4a5508	style: minor fixes * data/skeletons/bison.m4, doc/bison.texi: Spell check. * examples/c/bistromathic/parse.y (N_): Remove, now useless.	2020-04-25 08:00:08 +02:00
Akim Demaille	81334eb5a0	c, c++: provide a default definition for N_ In C/C++, N_ is a no-op. Define it if the user didn't. Suggested by Frank Heckenbach. https://lists.gnu.org/r/bug-bison/2020-04/msg00010.html * src/output.c (prepare_symbol_names): Rename has_translations as has_translations_flag. * data/skeletons/bison.m4 (b4_has_translations_if): New. * data/skeletons/java.m4 (b4_trans): Use it. * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/yacc.c (N_): Provide a default definition.	2020-04-20 07:37:45 +02:00
Akim Demaille	758172a8b9	doc: spell check * doc/bison.texi, NEWS, README-hacking.md: here. And elsewhere.	2020-04-13 18:50:05 +02:00
Akim Demaille	7a226860ef	doc: promote yytoken_kind_t, not yytokentype * data/skeletons/c.m4 (yytoken_kind_t): New. * data/skeletons/c++.m4, data/skeletons/lalr1.cc (yysymbol_kind_type): New. * examples/c/lexcalc/parse.y, examples/c/reccalc/parse.y, * tests/regression.at: Use them. * doc/bison.texi: Replace "enum yytokentype" by "yytoken_kind_t". (api.token.raw): Explain that it forces "yytoken_kind_t" to coincide with "yysymbol_kind_t". (Calling Convention): Mention YYEOF. (Table of Symbols): Add entries for "yytoken_kind_t" and "yysymbol_kind_t". (Glossary): Add entries for "Kind", "Token kind" and "Symbol kind".	2020-04-12 19:24:12 +02:00
Akim Demaille	e50de09886	tokens: properly define the YYEOF token kind Currently EOF is handled in an adhoc way, with a #define YYEOF 0 in the implementation file. As a result, the user has to define her own EOF token if she wants to use it, which is a pity. Give the $end token a visible kind name, YYEOF. Except that in C, where enums are not scoped, we would have collisions between all the definitions of YYEOFs in the header files, so in C, make it <api.PREFIX>EOF. * data/skeletons/c.m4 (YYEOF): Override its name to avoid collisions. Unless the user already gave it a different name. * data/skeletons/glr.c (YYEOF): Remove. Use ]b4_symbol(0, [id])[ instead. Add support for "pre_epilogue", for glr.cc. * data/skeletons/glr.cc: Remove dead code (never emitted #undefs). * data/skeletons/yacc.c * src/parse-gram.c * src/reader.c * src/symtab.c * tests/actions.at * tests/input.at	2020-04-12 13:56:44 +02:00
Akim Demaille	95421df67b	tokens: define the "$undefined" token kind * data/skeletons/bison.m4 (b4_symbol_token_kind): Give a definition to $undefined. (b4_token_visible_if): $undefined has an id. * src/output.c (prepare_symbol_definitions): Stop lying: $undefined _is_ a token. * tests/input.at: Adjust.	2020-04-12 13:56:43 +02:00
Akim Demaille	a4ed94bc13	tokens: properly define the "error" token kind There are people out there that do use YYERRCODE (the token kind of the error token). See for instance `3812012bb7/unixODBC-2.3.2/Drivers/nn/yylex.c`. Currently, YYERRCODE is defined by yacc.c in an adhoc way as a #define in the .c file only. It belongs with the other token kinds. YYERRCODE is not a nice name, it does not fit in our naming scheme. YYERROR would be more logical, but it collides with the YYERROR macro. Shall we keep the same name in all the skeletons? Besides, to avoid collisions in C, we need to apply the api prefix: YYERRCODE is actually <PREFIX>ERRCODE. This is not needed in the other languages. data/skeletons/bison.m4 (b4_symbol_token_kind): New. Map the error token to "YYERRCODE". * data/skeletons/yacc.c (YYERRCODE): Don't define it, it's handled by... * src/output.c (prepare_symbol_definitions): this. * tests/input.at (Redefining the error token): Check it.	2020-04-12 13:56:43 +02:00
Akim Demaille	07726f1178	tokens: style: minor fixes * data/skeletons/bison.m4 (b4_symbol_kind): Dispatch on the UNDEF token number rather than its name. * data/skeletons/c++.m4, data/skeletons/c.m4, data/skeletons/java.m4: Comment changes.	2020-04-12 13:56:43 +02:00
Akim Demaille	007e1b5f0a	symbols: minor fixes * data/skeletons/bison.m4 (b4_symbol_kind): Series of _ are useless, one is enough. * data/skeletons/c.m4 (b4_token_enum): Fix overquoting.	2020-04-10 18:33:02 +02:00
Akim Demaille	bbb9750b3e	skeletons: introduce api.symbol.prefix * data/skeletons/bison.m4 (b4_symbol_prefix): New. (b4_symbol_kind): Use it. * data/skeletons/c++.m4, data/skeletons/c.m4, data/skeletons/d.m4 * data/skeletons/java.m4 (api.symbol.prefix): Provide a default value. * data/skeletons/glr.c, data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, data/skeletons/yacc.c: Adjust: use b4_symbol_prefix instead of YYSYMBOL_.	2020-04-07 08:40:16 +02:00
Akim Demaille	e657f04b62	c: make the symbol kind definition nicer to read From enum yysymbol_kind_t { YYSYMBOL_YYEMPTY = -2, YYSYMBOL_YYEOF = 0, YYSYMBOL_YYERROR = 1, YYSYMBOL_YYUNDEF = 2, to enum yysymbol_kind_t { YYSYMBOL_YYEMPTY = -2, YYSYMBOL_YYEOF = 0, /* "end of file" / YYSYMBOL_YYERROR = 1, / error / YYSYMBOL_YYUNDEF = 2, / $undefined / data/skeletons/bison.m4 (b4_last_symbol): New. (b4_symbol_enum, b4_symbol_enums): Reformat the output. * data/skeletons/c.m4	2020-04-06 18:43:34 +02:00
Akim Demaille	10e61eec6d	c: make the token kind definition nicer to read From enum gram_tokentype { GRAM_EOF = 0, STRING = 3, TSTRING = 4, PERCENT_TOKEN = 5, To enum gram_tokentype { GRAM_EOF = 0, /* "end of file" / STRING = 3, / "string" / TSTRING = 4, / "translatable string" / PERCENT_TOKEN = 5, / "%token" / data/skeletons/bison.m4 (b4_last_enum_token): New. * data/skeletons/c.m4 (b4_token_enum, b4_token_enums): Show the corresponding symbol.	2020-04-06 18:43:34 +02:00
Akim Demaille	f0bb82ae9e	skeletons: use consistently "kind" instead of "type" in the code * data/skeletons/bison.m4, data/skeletons/c++.m4, data/skeletons/c.m4, * data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java: Refer to the "kind" of a symbol, not its "type", where appropriate.	2020-04-05 19:14:39 +02:00
Akim Demaille	2b7bde9d13	m4: rename b4_symbol_sid as b4_symbol_kind * data/skeletons/bison.m4, data/skeletons/c++.m4, data/skeletons/c.m4, * data/skeletons/d.m4, data/skeletons/java.m4 (b4_symbol_sid): Rename as... (b4_symbol_kind): this. Adjust dependencies. * data/README.md: Document the kind.	2020-04-05 14:56:19 +02:00
Akim Demaille	fd37eb057e	yysymbol_type_t: always assign an enumerator Currently we define enumerators only for symbols that have an identifier. That rules out tokens such as '+', and nonterminals such as foo-bar and foo.bar. As a consequence we are taking chances: the compiler might compile yysymbol_type_t as too small an integral type for some symbol codes. * data/skeletons/bison.m4 (b4_symbol_sid): Forge a unique symbol identifier for symbols that don't have an ID.	2020-04-01 08:31:48 +02:00
Akim Demaille	75a605454d	yacc.c: prefer YYSYMBOL_YYERROR to YYSYMBOL_error * data/skeletons/bison.m4 (b4_symbol_sid): Map "error" to YYSYMBOL_YYERROR. * data/skeletons/yacc.c: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	f3c18c8e80	yacc.c: also define a symbol number for the empty token This is not only cleaner, it also protects us from mixing signed values (YYEMPTY is #defined as -2) with unsigned types (the yysymbol_type_t enum is typically compiled as a small unsigned). For instance GCC 9: input.c: In function 'yyparse': input.c:1107:7: error: conversion to 'unsigned int' from 'int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~ input.c:1107:10: error: conversion to 'int' from 'unsigned int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~~~~~~ input.c:1108:47: error: comparison of integer expressions of different signedness: 'yytype_int8' {aka 'const signed char'} and 'yysymbol_type_t' {aka 'enum yysymbol_type_t'} [-Werror=sign-compare] 1108 \| if (yyn < 0 \|\| YYLAST < yyn \|\| yycheck[yyn] != yytoken) \| ^~ input.c:702:25: error: operand of ?: changes signedness from 'int' to 'unsigned int' due to unsignedness of other operand [-Werror=sign-compare] 702 \| #define YYEMPTY (-2) \| ^~~~ input.c:1220:33: note: in expansion of macro 'YYEMPTY' 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^~~~~~~ input.c:1220:41: error: unsigned conversion from 'int' to 'unsigned int' changes value from '-2' to '4294967294' [-Werror=sign-conversion] 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^ Eventually, it might be interesting to move away from -2 (which is the only possible negative symbol number) and use the next available number, to save bits. We could actually even simply use "0" and shift the rest, which would allow to write "!yytoken" to mean really "yytoken != YYEMPTY". * data/skeletons/c.m4 (b4_declare_symbol_enum): Define YYSYMBOL_YYEMPTY. * data/skeletons/yacc.c: Use it. * src/parse-gram.y (yyreport_syntax_error): Use YYSYMBOL_YYEMPTY, not YYEMPTY, when dealing with a symbol. * tests/regression.at: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	3ba001baac	yacc.c: introduce an enum that defines the symbol's number There's a number of advantage in exposing the symbol (internal) numbers: - custom error messages can use them to decide how to represent a given symbol, or a set of symbols. - we need something similar in uses of yyexpected_tokens. For instance, currently, bistromathic's completion() reads: int ntokens = expected_tokens (line, tokens, YYNTOKENS); [...] for (int i = 0; i < ntokens; ++i) if (tokens[i] == YYTRANSLATE (TOK_VAR)) [...] else if (tokens[i] == YYTRANSLATE (TOK_FUN)) [...] else [...] - now that it's a compile-time expression, we can easily build static tables, switch, etc. - some users depended on the ability to get the token number from a symbol to write test cases for their scanners. But Bison 3.5 removed the table this feature depended upon (a reverse yytranslate). Now they can check against the actual symbol number, without having pay (space and time) a conversion. See https://lists.gnu.org/r/bug-bison/2020-01/msg00001.html, and https://lists.gnu.org/archive/html/bug-bison/2020-03/msg00015.html. - it helps us clearly separate the internal symbol numbers from the external token numbers, whose difference is sometimes blurred in the code when values coincide (e.g. "yychar = yytoken = YYEOF"). - it allows us to get rid of ugly macros with inconsistent names such as YYUNDEFTOK and YYTERROR, and to group related definitions together. - similarly it provides a clean access to the $accept symbol (which proves convenient in a current experimentation of mine with several %start symbols). Let's declare this type as a private type (in the .c file, not the .h one). So it does not need to be influenced by the api prefix. * data/skeletons/bison.m4 (b4_symbol_sid): New. (b4_symbol): Use it. * data/skeletons/c.m4 (b4_symbol_enum, b4_declare_symbol_enum): New. * data/skeletons/yacc.c: Use b4_declare_symbol_enum. (YYUNDEFTOK, YYTERROR): Remove. Use the corresponding symbol enum instead.	2020-04-01 08:31:33 +02:00
Akim Demaille	4140320a0a	style: comment changes about token numbers * data/skeletons/bison.m4, data/skeletons/c.m4: here.	2020-03-30 08:41:12 +02:00
Akim Demaille	77bdcc6f0c	parse.error: document and diagnose the incompatibility with %token-table * doc/bison.texi (Tokens from Literals): Move to code using %token-table to... (Decl Summary: %token-table): here. * data/skeletons/bison.m4: Implement mutual exclusion. * tests/input.at: Check it. * doc/local.mk: Be robust to the removal of doc/.	2020-02-10 20:15:46 +01:00
Akim Demaille	bc74b4b15a	skeletons: avoid b4_error_verbose_if, which is confusing parse.error has more than two possible values. * data/skeletons/bison.m4 (b4_error_verbose_if, b4_error_verbose_flag): Remove. (b4_parse_error_case, b4_parse_error_bmatch): New. Adjust dependencies.	2020-02-10 07:24:38 +01:00
Akim Demaille	8dd8137c38	skeletons: decorelate %token-table from verbose error messages Reported by Adrian Vogelsgesang. * data/skeletons/bison.m4: Here. * data/skeletons/lalr1.cc: Adjust.	2020-02-10 07:24:38 +01:00
Akim Demaille	650b253843	m4: fix b4_token_format We used to emit: /** Token number,to be returned by the scanner. / static final int NUM = 258; /* Token number,to be returned by the scanner. / static final int NEG = 259; with no space after the comma. Fix that. data/skeletons/bison.m4 (b4_token_format): Quote where appropriate.	2020-02-08 11:24:53 +01:00
Akim Demaille	f443673450	yacc.c: add support for parse.error detailed "detailed" error messages are almost like "verbose", except that we don't double escape them, they don't get inner quotes, we don't use yytnamerr, and we hide the table. "custom" is exposed with the "detailed" tokens, not the "verbose" ones: they are not double-quoted. Because there's a risk that some people use yytname even without "verbose", let's keep yytname (instead of yys_name) in "simple" parse.error. * src/output.c (prepare_symbol_names): Be ready to output symbol names unquoted. (prepare_symbol_names): Output both the old tname table, and the new symbol_names one. * data/skeletons/bison.m4: Accept 'detailed'. * data/skeletons/yacc.c: When parse.error is 'detailed', don't emit yytname and yytnamerr, just yysymbol_name with the table inside. * tests/calc.at: Adjust.	2020-01-19 14:51:14 +01:00
Akim Demaille	cda1934606	yacc.c: add custom error message generation When parse.error is custom, let users define a yyreport_syntax_error function, and use it. * data/skeletons/bison.m4 (b4_error_verbose_if): Accept 'custom'. * data/skeletons/yacc.c: Implement it. * examples/c/calc/calc.y: Experiment with it.	2020-01-17 06:49:59 +01:00
Akim Demaille	8036635251	package: bump copyrights to 2020 Run 'make update-copyright'.	2020-01-05 10:26:35 +01:00
Akim Demaille	fc2040a750	c++: fix comments for %code blocks In a project of mine, vcsn, this commit fixes the following comments. --- /tmp/parse.hh 2019-12-08 15:51:24.792934703 +0100 +++ lib/vcsn/rat/parse.hh 2019-12-08 16:00:59.137107503 +0100 @@ -43,7 +43,7 @@ #ifndef YY_YY_USERS_AKIM_SRC_LRDE_2_LIB_VCSN_RAT_PARSE_HH_INCLUDED # define YY_YY_USERS_AKIM_SRC_LRDE_2_LIB_VCSN_RAT_PARSE_HH_INCLUDED -// // "%code requires" blocks. +// "%code requires" blocks. #line 20 "/Users/akim/src/lrde/2/lib/vcsn/rat/parse.yy" #include <iostream> @@ -1851,7 +1851,7 @@ -// // "%code provides" blocks. +// "%code provides" blocks. #line 60 "/Users/akim/src/lrde/2/lib/vcsn/rat/parse.yy" #define YY_DECL_(Class) \ * data/skeletons/bison.m4 (b4_percent_code_get): Pass an expanded string to b4_comment.	2019-12-08 16:03:36 +01:00
Akim Demaille	9e9e49224f	diagnostics: style changes * src/complain.h, src/complain.c: Comment changes. * src/scan-skel.l: Reduce scopes. * data/skeletons/bison.m4: Factor diagnostic functions.	2019-12-02 19:35:01 +01:00
Akim Demaille	9861bcc540	api.token.raw: implement Bison used to feature %raw, documented as follows: @item %raw The output file @file{@var{name}.h} normally defines the tokens with Yacc-compatible token numbers. If this option is specified, the internal Bison numbers are used instead. (Yacc-compatible numbers start at 257 except for single character tokens; Bison assigns token numbers sequentially for all tokens starting at 3.) Unfortunately, as far as I can tell, it never worked: token numbers are indeed changed in the generated tables (from external token number to internal), yet the code was still applying the mapping from external token numbers to internal token numbers. This commit reintroduces the feature as it was expected to be. * data/skeletons/bison.m4 (b4_token_format): When api.token.raw is enabled, use the internal token number. * data/skeletons/yacc.c (yytranslate): Don't emit if api.token.raw is enabled. (YYTRANSLATE): Adjust.	2019-09-14 09:55:17 +02:00
Akim Demaille	1161649446	preserve the indentation in the ouput Preserve the actions' initial indentation. For instance, on \| %define api.value.type {int} \| %% \| exp: exp '/' exp { if ($3) \| $$ = $1 + $3; \| else \| $$ = 0; } we used to generate \| { if (yyvsp[0]) \| yyval = yyvsp[-2] + yyvsp[0]; \| else \| yyval = 0; } now we produce \| { if (yyvsp[0]) \| yyval = yyvsp[-2] + yyvsp[0]; \| else \| yyval = 0; } See https://lists.gnu.org/archive/html/bison-patches/2019-06/msg00012.html. * data/skeletons/bison.m4 (b4_symbol_action): Output the code in column 0, leave indentation matters to the C code. * src/output.c (user_actions_output): Preserve the incoming indentation in the output. (prepare_symbol_definitions): Likewise for %printer/%destructor. * tests/synclines.at (Output columns): New.	2019-07-02 07:38:52 +02:00
Akim Demaille	9260e5ca4f	api.location.type: support it in C Reported by Balázs Scheidler. * data/skeletons/c.m4 (b4_location_type_define): Use api.location.type if defined. * doc/bison.texi: Document it. * tests/local.at (AT_C_IF, AT_LANG_CASE): New. Support Span in C. * tests/calc.at (Span): Convert it to be usable in C and C++. Check api.location.type with yacc.c and glr.c.	2019-04-25 20:20:59 +02:00

1 2

58 Commits