bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-03-09 04:13:03 +00:00

Author	SHA1	Message	Date
Akim Demaille	d9a9b054ae	all: fix the interface of yyexpected_tokens The user gives yyexpected_tokens a limit: the max number of tokens she wants to hear about. That's because an error message that reports a bazillion of possible tokens is useless. In that case yyexpected_tokens returned 0, so the user would not know if there are too many expected tokens or none (yes, that's possible). There are several ways to tell the user in which situation she's in: - return some E2MANY, a negative value. Then it makes the pattern int argsize = yypcontext_expected_tokens (ctx, arg, ARGS_MAX); if (argsize < 0) return argsize; no longer valid, as for E2MANY (i) the user must generate the error message anyway, and (ii) she should not return E2MANY - return ARGS_MAX + 1. Then it makes it dangerous for the user, as she has to iterate update `min (ARGS_MAX, argsize)`. Returning 0 is definitely simpler and safer for the user, as it tells her "this is not an error, just generate your message without a list of expecting tokens". So let's still return 0, but set arg[0] to the empty token when the list is really empty. * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/lalr1.java * data/skeletons/yacc.c (yyexpected_tokens): Put the empty symbol first if there are no possible tokens at all. * examples/c/bistromathic/parse.y: Demonstrate how to use that.	2020-05-06 08:11:52 +02:00
Akim Demaille	4b85b969d0	glr.c: beware of portability issues with PTRDIFF_MAX For instance test 386, "glr.cc api.value.type={double}": types.at:366: $CXX $CXXFLAGS $CPPFLAGS $LDFLAGS -o test test.cc $LIBS stderr: test.cc: In function 'ptrdiff_t yysplitStack(yyGLRStack, ptrdiff_t)': test.cc:490:4: error: 'PTRDIFF_MAX' was not declared in this scope (PTRDIFF_MAX < SIZE_MAX ? PTRDIFF_MAX : YY_CAST (ptrdiff_t, SIZE_MAX)) ^ test.cc:1805:37: note: in expansion of macro 'YYSIZEMAX' ptrdiff_t half_max_capacity = YYSIZEMAX / 2 / state_size; ^~~~~~~~~ test.cc:490:4: note: suggested alternative: '__PTRDIFF_MAX__' (PTRDIFF_MAX < SIZE_MAX ? PTRDIFF_MAX : YY_CAST (ptrdiff_t, SIZE_MAX)) ^ test.cc:1805:37: note: in expansion of macro 'YYSIZEMAX' ptrdiff_t half_max_capacity = YYSIZEMAX / 2 / state_size; ^~~~~~~~~ The failing tests are using glr.cc only, which I don't understand, the problem is rather in glr.c, so I would expect glr.c tests to also fail. Reported by Bruno Haible. https://lists.gnu.org/archive/html/bug-bison/2020-05/msg00053.html data/skeletons/yacc.c: Move the block that defines YYPTRDIFF_T/YYPTRDIFF_MAXIMUM, YYSIZE_T/YYSIZE_MAXIMUM, and YYSIZEOF to... * data/skeletons/c.m4 (b4_sizes_types_define): Here. (b4_c99_int_type): Also take care of the #undefinition of short. * data/skeletons/yacc.c, data/skeletons/glr.c: Use b4_sizes_types_define. * data/skeletons/glr.c: Adjust to use YYPTRDIFF_T/YYPTRDIFF_MAXIMUM, YYSIZE_T/YYSIZE_MAXIMUM.	2020-05-04 07:44:42 +02:00
Akim Demaille	76c3bccf40	yacc.c: improve formatting of the generated code * data/skeletons/yacc.c (yy_reduce_print): here.	2020-05-02 10:17:01 +02:00
Akim Demaille	fb1d76d9a9	yacc.c: avoid the use of a temporary * data/skeletons/yacc.c: Use YYLLOC_DEFAULT directly with the final destination.	2020-04-30 08:07:55 +02:00
Akim Demaille	3b05de2d05	yacc.c: install backward compatibility for YYERRCODE Some people have been using that symbol. Some even have #defined it themselves. https://lists.gnu.org/r/bison-patches/2020-04/msg00138.html Let's provide backward compatibility, having it point to YYUNDEF, so that an error message is generated. * data/skeletons/yacc.c (YYERRCODE): New, at the exact same location it was defined before.	2020-04-28 08:26:49 +02:00
Akim Demaille	b254b36db8	all: don't emit an error message when the scanner returns YYERRCODE I'm quite pleased to see that the tricky case of glr.c was already prepared by the changes to support syntax_error exceptions. Better yet, it is actually syntax_error that becomes a special case of the general pattern: make yytoken be YYERRCODE. * data/skeletons/glr.c (YYFAULTYTOK): Remove the now useless (Basil) Faulty token. Instead, use the error token. * data/skeletons/lalr1.d, data/skeletons/lalr1.java: When computing the action, first check the case of the error token. * tests/calc.at: Check cases for the error token symbols before and after it.	2020-04-26 19:55:52 +02:00
Akim Demaille	58e79539fc	c: don't emit an error message when the scanner returns YYERRCODE * data/skeletons/yacc.c (yyparse): When the scanner returns YYERRCODE, go directly to error recovery (yyerrlab1). However, don't keep the error token as lookahead, that token is too special. * data/skeletons/lalr1.cc: Likewise. * examples/c/bistromathic/parse.y (yylex): Use that feature to report nicely invalid characters. * examples/c/bistromathic/bistromathic.test: Check that. * examples/test: Neutralize gratuitous differences such as rule position. * tests/calc.at: Check that case in C only. The other case seem to be working, but that's an illusion that the next commit will address (in fact, they can enter endless loops, and report the error several times anyway).	2020-04-26 18:05:30 +02:00
Akim Demaille	286d0755f8	all: prefer YYERRCODE to YYERROR We will not keep YYERRCODE anyway, it causes backward compatibility issues. So as a first step, let all the skeletons use that name, until we have a better one. * data/skeletons/bison.m4, data/skeletons/glr.c, * data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, * data/skeletons/yacc.c, doc/bison.texi, tests/headers.at, * tests/input.at: here.	2020-04-26 15:09:51 +02:00
Akim Demaille	9c72d3c5a8	style: prefer b4_has_translations_if * data/skeletons/glr.c, data/skeletons/yacc.c: here.	2020-04-26 13:34:30 +02:00
Akim Demaille	22eeb1ab8a	style: fix a few remaining 'type' instead of 'kind' * data/skeletons/glr.c, data/skeletons/yacc.c (YY_SYMBOL_PRINT): Here.	2020-04-26 10:57:22 +02:00
Akim Demaille	c4dbc1776c	skeletons: make the warning about implementation details clearer * data/skeletons/bison.m4 (b4_disclaimer): Here. * data/skeletons/lalr1.d, data/skeletons/lalr1.java: Use it.	2020-04-26 10:57:02 +02:00
Akim Demaille	b74fc07d21	style: c: fix a few minor issues about indentation of cpp directives * README-hacking.md: More about cpp. * data/skeletons/c.m4, data/skeletons/yacc.c: Style changes.	2020-04-25 12:16:57 +02:00
Akim Demaille	150dc95395	style: clarify #endif We could try to avoid the weird "#if 1", but then the indentation of the inner #if would be wrong. Let' keep it this way. * data/skeletons/yacc.c: here. Also, avoid sticking the comment to the directive.	2020-04-25 11:06:16 +02:00
Akim Demaille	81334eb5a0	c, c++: provide a default definition for N_ In C/C++, N_ is a no-op. Define it if the user didn't. Suggested by Frank Heckenbach. https://lists.gnu.org/r/bug-bison/2020-04/msg00010.html * src/output.c (prepare_symbol_names): Rename has_translations as has_translations_flag. * data/skeletons/bison.m4 (b4_has_translations_if): New. * data/skeletons/java.m4 (b4_trans): Use it. * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/yacc.c (N_): Provide a default definition.	2020-04-20 07:37:45 +02:00
Akim Demaille	9b7e7077dd	style: fix comments * data/skeletons/glr.c, data/skeletons/lalr1.cc, * data/skeletons/yacc.c: here.	2020-04-19 15:40:12 +02:00
Akim Demaille	caadfc552b	skeletons: use symbol(-2, kind) Not all the symbols have a fixed symbol code. UNDEF's one is fixed: -2. * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/lalr1.d, * data/skeletons/yacc.c: here.	2020-04-16 07:35:06 +02:00
Akim Demaille	c4c25e091c	style: comments changes about error handling * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/lalr1.d, * data/skeletons/lalr1.java, data/skeletons/yacc.c: here. * data/skeletons/lalr1.cc: Reduce scope.	2020-04-16 07:34:37 +02:00
Akim Demaille	64aec0a8d8	c, c++: also define YYEMPTY in yytoken_kind_t I have been hesitating a lot before doing it ---after all the user must not use this kind, so what's the point of showing it in yytoken_kind_t. And eventually I chose to play it safe with the typing system and make it possible to use yytoken_kind_t for all the tokens, even the "empty token". * data/skeletons/c.m4: Give an id and a tag to YYEMPTY. (b4_token_enums): Define YYEMPTY. * data/skeletons/c++.m4 (b4_token_enums): Define YYEMPTY. * data/skeletons/glr.c, data/skeletons/glr.cc, data/skeletons/yacc.c: (YYEMPTY): Remove. Use b4_symbol(-2, id) instead.	2020-04-13 16:49:48 +02:00
Akim Demaille	5839f4d289	c: rename yyexpected_tokens as yypcontext_expected_tokens The user should think of yypcontext fields as accessible only via yypcontext_* functions. So let's rename yyexpected_tokens to reflect that. Let's _not_ rename yyreport_syntax_error, as the user may define this function, and is not allowed to access directly the fields of yypcontext_t: she must use the "accessors". This is comparable to the case of C++/Java where the user defines parser::report_syntax_error, not parser::context::report_syntax_error. * data/skeletons/glr.c, data/skeletons/yacc.c (yyexpected_tokens): Rename as... (yypcontext_expected_tokens): this. Adjust dependencies.	2020-04-12 19:23:40 +02:00
Akim Demaille	e50de09886	tokens: properly define the YYEOF token kind Currently EOF is handled in an adhoc way, with a #define YYEOF 0 in the implementation file. As a result, the user has to define her own EOF token if she wants to use it, which is a pity. Give the $end token a visible kind name, YYEOF. Except that in C, where enums are not scoped, we would have collisions between all the definitions of YYEOFs in the header files, so in C, make it <api.PREFIX>EOF. * data/skeletons/c.m4 (YYEOF): Override its name to avoid collisions. Unless the user already gave it a different name. * data/skeletons/glr.c (YYEOF): Remove. Use ]b4_symbol(0, [id])[ instead. Add support for "pre_epilogue", for glr.cc. * data/skeletons/glr.cc: Remove dead code (never emitted #undefs). * data/skeletons/yacc.c * src/parse-gram.c * src/reader.c * src/symtab.c * tests/actions.at * tests/input.at	2020-04-12 13:56:44 +02:00
Akim Demaille	a4ed94bc13	tokens: properly define the "error" token kind There are people out there that do use YYERRCODE (the token kind of the error token). See for instance `3812012bb7/unixODBC-2.3.2/Drivers/nn/yylex.c`. Currently, YYERRCODE is defined by yacc.c in an adhoc way as a #define in the .c file only. It belongs with the other token kinds. YYERRCODE is not a nice name, it does not fit in our naming scheme. YYERROR would be more logical, but it collides with the YYERROR macro. Shall we keep the same name in all the skeletons? Besides, to avoid collisions in C, we need to apply the api prefix: YYERRCODE is actually <PREFIX>ERRCODE. This is not needed in the other languages. data/skeletons/bison.m4 (b4_symbol_token_kind): New. Map the error token to "YYERRCODE". * data/skeletons/yacc.c (YYERRCODE): Don't define it, it's handled by... * src/output.c (prepare_symbol_definitions): this. * tests/input.at (Redefining the error token): Check it.	2020-04-12 13:56:43 +02:00
Akim Demaille	8dcc25a1e4	style: rename YYNOMEM as YYENOMEM This is clearer. * data/skeletons/glr.c, data/skeletons/yacc.c (YYNOMEM): Rename as... (YYENOMEM): here.	2020-04-10 18:35:29 +02:00
Akim Demaille	bbb9750b3e	skeletons: introduce api.symbol.prefix * data/skeletons/bison.m4 (b4_symbol_prefix): New. (b4_symbol_kind): Use it. * data/skeletons/c++.m4, data/skeletons/c.m4, data/skeletons/d.m4 * data/skeletons/java.m4 (api.symbol.prefix): Provide a default value. * data/skeletons/glr.c, data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, data/skeletons/yacc.c: Adjust: use b4_symbol_prefix instead of YYSYMBOL_.	2020-04-07 08:40:16 +02:00
Akim Demaille	87579e03e0	skeletons: beware not to use yyarg when it's null Reported by Adrian Vogelsgesang. * data/skeletons/glr.c, data/skeletons/lalr1.cc, * data/skeletons/lalr1.java, data/skeletons/yacc.c: Here.	2020-04-06 19:14:11 +02:00
Akim Demaille	f0bb82ae9e	skeletons: use consistently "kind" instead of "type" in the code * data/skeletons/bison.m4, data/skeletons/c++.m4, data/skeletons/c.m4, * data/skeletons/glr.cc, data/skeletons/lalr1.cc, * data/skeletons/lalr1.d, data/skeletons/lalr1.java: Refer to the "kind" of a symbol, not its "type", where appropriate.	2020-04-05 19:14:39 +02:00
Akim Demaille	2c05fc750a	c, c++: rename yysymbol_type_t as yysymbol_kind_t See https://lists.gnu.org/r/bison-patches/2020-04/msg00031.html * data/skeletons/c.m4, data/skeletons/glr.c, data/skeletons/yacc.c (yysymbol_type_t): Rename as... (yysymbol_kind_t): this. Adjust dependencies. * data/skeletons/c++.m4, data/skeletons/glr.cc, data/skeletons/lalr1.cc (symbol_type_type): Rename as... (symbol_kind_type): this. Adjust dependencies.	2020-04-05 14:56:18 +02:00
Akim Demaille	4e26809ab9	style: rename yysyntax_error_arguments as yy_syntax_error_arguments It's a private implementation detail. * NEWS, data/skeletons/glr.c, data/skeletons/lalr1.cc, * data/skeletons/yacc.c, doc/bison.texi: here.	2020-04-05 08:56:21 +02:00
Akim Demaille	76e11b5a3e	c: rename yyparse_context_t as yypcontext_t The first name is too long. We already have `yypstate`, so `yypcontext` is ok. We are also migrating to using `_t` for our types. NEWS, data/skeletons/glr.c, data/skeletons/yacc.c, doc/bison.texi, * examples/c/bistromathic/parse.y, src/parse-gram.y, tests/local.at: (yyparse_context_t, yyparse_context_location, yyparse_context_token): Rename as... (yypcontext_t, yypcontext_location, yypcontext_token): these.	2020-04-04 19:20:29 +02:00
Akim Demaille	086506bf23	glr.c, yacc.c: propagate yysymbol_type_t Now that yacc.c and glr.c both know yysymbol_type_t, convert the common routines. * data/skeletons/c.m4 (yydestruct, yy_symbol_value_print) (yy_symbol_print): Use yysymbol_type_t instead of int. * data/skeletons/glr.c: Use yySymbol where appropriate. * data/skeletons/yacc.c (YY_ACCESSING_SYMBOL): New wrapper around yystos. Use it. * tests/local.at (yyreport_syntax_error): Use yysymbol_type_t where appropriate.	2020-04-01 08:31:48 +02:00
Akim Demaille	9039c571f4	yacc.c: fix more errors from make maintainer-check-g++ * data/skeletons/yacc.c (yyexpected_tokens): Use casts where needed.	2020-04-01 08:31:48 +02:00
Akim Demaille	9434571f95	yacc.c: revert to not using yysymbol_type_t in the yytranslate table This triggers warnings with several compilers. For instance ICC fills the logs with pages and pages of input.c(477): error: a value of type "int" cannot be used to initialize an entity of type "const yysymbol_type_t={yysymbol_type_t}" 0, 2, 2, 2, 2, 2, 2, 2, 2, 2, ^ input.c(477): error: a value of type "int" cannot be used to initialize an entity of type "const yysymbol_type_t={yysymbol_type_t}" 0, 2, 2, 2, 2, 2, 2, 2, 2, 2, ^ And so does G++9 when compiling yacc.c's (C) output input.c:545:8: error: invalid conversion from 'int' to 'yysymbol_type_t' [-fpermissive] 545 \| 0, 5, 9, 2, 2, 2, 2, 2, 2, 2, \| ^ \| \| \| int input.c:545:15: error: invalid conversion from 'int' to 'yysymbol_type_t' [-fpermissive] 545 \| 0, 5, 9, 2, 2, 2, 2, 2, 2, 2, \| ^ \| \| \| int Clang++ is no exception input.c:545:8: error: cannot initialize an array element of type 'const yysymbol_type_t' with an rvalue of type 'int' 0, 5, 9, 2, 2, 2, 2, 2, 2, 2, ^ input.c:545:15: error: cannot initialize an array element of type 'const yysymbol_type_t' with an rvalue of type 'int' 0, 5, 9, 2, 2, 2, 2, 2, 2, 2, ^ At some point we could use yysymbol_type_t's enumerators to define yytranslate. Meanwhile... * data/skeletons/yacc.c (yytranslate): Use the original integral type to define it. (YYTRANSLATE): Cast the result into yysymbol_type_t.	2020-04-01 08:31:48 +02:00
Akim Demaille	75a605454d	yacc.c: prefer YYSYMBOL_YYERROR to YYSYMBOL_error * data/skeletons/bison.m4 (b4_symbol_sid): Map "error" to YYSYMBOL_YYERROR. * data/skeletons/yacc.c: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	f3c18c8e80	yacc.c: also define a symbol number for the empty token This is not only cleaner, it also protects us from mixing signed values (YYEMPTY is #defined as -2) with unsigned types (the yysymbol_type_t enum is typically compiled as a small unsigned). For instance GCC 9: input.c: In function 'yyparse': input.c:1107:7: error: conversion to 'unsigned int' from 'int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~ input.c:1107:10: error: conversion to 'int' from 'unsigned int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~~~~~~ input.c:1108:47: error: comparison of integer expressions of different signedness: 'yytype_int8' {aka 'const signed char'} and 'yysymbol_type_t' {aka 'enum yysymbol_type_t'} [-Werror=sign-compare] 1108 \| if (yyn < 0 \|\| YYLAST < yyn \|\| yycheck[yyn] != yytoken) \| ^~ input.c:702:25: error: operand of ?: changes signedness from 'int' to 'unsigned int' due to unsignedness of other operand [-Werror=sign-compare] 702 \| #define YYEMPTY (-2) \| ^~~~ input.c:1220:33: note: in expansion of macro 'YYEMPTY' 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^~~~~~~ input.c:1220:41: error: unsigned conversion from 'int' to 'unsigned int' changes value from '-2' to '4294967294' [-Werror=sign-conversion] 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^ Eventually, it might be interesting to move away from -2 (which is the only possible negative symbol number) and use the next available number, to save bits. We could actually even simply use "0" and shift the rest, which would allow to write "!yytoken" to mean really "yytoken != YYEMPTY". * data/skeletons/c.m4 (b4_declare_symbol_enum): Define YYSYMBOL_YYEMPTY. * data/skeletons/yacc.c: Use it. * src/parse-gram.y (yyreport_syntax_error): Use YYSYMBOL_YYEMPTY, not YYEMPTY, when dealing with a symbol. * tests/regression.at: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	00c80bc96c	yacc.c: use yysymbol_type_t instead of int for yytoken Now that we have a proper type for internal symbol numbers, let's use it. More code needs conversion, e.g., printers and destructors, but they are shared with glr.c, which is not ready yet for this change. It will also help us deal with warnings such as (GCC9 on GNU/Linux): input.c: In function 'int yyparse()': input.c:475:37: error: enumeral and non-enumeral type in conditional expression [-Werror=extra] 475 \| (0 <= (YYX) && (YYX) <= YYMAXUTOK ? yytranslate[YYX] : YYSYMBOL_YYUNDEF) \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ input.c:1024:17: note: in expansion of macro 'YYTRANSLATE' 1024 \| yytoken = YYTRANSLATE (yychar); \| ^~~~~~~~~~~ * data/skeletons/yacc.c (yytranslate, yysymbol_name) (yyparse_context_t, yyexpected_tokens, yypstate_expected_tokens) (yysyntax_error_arguments): Use yysymbol_type_t instead of int.	2020-04-01 08:31:48 +02:00
Akim Demaille	3ba001baac	yacc.c: introduce an enum that defines the symbol's number There's a number of advantage in exposing the symbol (internal) numbers: - custom error messages can use them to decide how to represent a given symbol, or a set of symbols. - we need something similar in uses of yyexpected_tokens. For instance, currently, bistromathic's completion() reads: int ntokens = expected_tokens (line, tokens, YYNTOKENS); [...] for (int i = 0; i < ntokens; ++i) if (tokens[i] == YYTRANSLATE (TOK_VAR)) [...] else if (tokens[i] == YYTRANSLATE (TOK_FUN)) [...] else [...] - now that it's a compile-time expression, we can easily build static tables, switch, etc. - some users depended on the ability to get the token number from a symbol to write test cases for their scanners. But Bison 3.5 removed the table this feature depended upon (a reverse yytranslate). Now they can check against the actual symbol number, without having pay (space and time) a conversion. See https://lists.gnu.org/r/bug-bison/2020-01/msg00001.html, and https://lists.gnu.org/archive/html/bug-bison/2020-03/msg00015.html. - it helps us clearly separate the internal symbol numbers from the external token numbers, whose difference is sometimes blurred in the code when values coincide (e.g. "yychar = yytoken = YYEOF"). - it allows us to get rid of ugly macros with inconsistent names such as YYUNDEFTOK and YYTERROR, and to group related definitions together. - similarly it provides a clean access to the $accept symbol (which proves convenient in a current experimentation of mine with several %start symbols). Let's declare this type as a private type (in the .c file, not the .h one). So it does not need to be influenced by the api prefix. * data/skeletons/bison.m4 (b4_symbol_sid): New. (b4_symbol): Use it. * data/skeletons/c.m4 (b4_symbol_enum, b4_declare_symbol_enum): New. * data/skeletons/yacc.c: Use b4_declare_symbol_enum. (YYUNDEFTOK, YYTERROR): Remove. Use the corresponding symbol enum instead.	2020-04-01 08:31:33 +02:00
Akim Demaille	59d820d1ef	c: use YYNOMEM instead of -2 See `84b1972c96`. * data/skeletons/glr.c, data/skeletons/yacc.c (YYNOMEM): New. Use it.	2020-03-28 15:13:27 +01:00
Akim Demaille	ee56b6e0f2	skeletons: make yysyntax_error_arguments a private detail We could just "inline yysyntax_error_arguments back" in the routines it was originally extracted from, but I think the code is nicer to read this way. * data/skeletons/glr.c (yysyntax_error_arguments): Generate only for detailed and verbose error messages. * data/skeletons/yacc.c: Likewise. * data/skeletons/lalr1.cc (parser::context::yysyntax_error_arguments): Move as... (parser::yysyntax_error_arguments_): this. And only for detailed and verbose error messages.	2020-03-28 15:13:27 +01:00
Akim Demaille	1045c8d0ef	examples: don't use yysyntax_error_arguments Suggested by Adrian Vogelsgesang. https://lists.gnu.org/archive/html/bison-patches/2020-02/msg00069.html * data/skeletons/lalr1.java (Context.EMPTY, Context.getToken): New. (Context.yyntokens): Rename as... (Context.NTOKENS): this. Because (i) all the Java coding styles recommend upper case for constants, and (ii) the Java Skeleton exposes Lexer.EOF, not Lexer.YYEOF. * data/skeletons/yacc.c (yyparse_context_token): New. * examples/c/bistromathic/parse.y (yyreport_syntax_error): Don't use yysyntax_error_arguments. * examples/java/calc/Calc.y (yyreportSyntaxError): Likewise.	2020-03-28 15:13:27 +01:00
Akim Demaille	ef8965b5f5	skeletons: fix incorrect type for translatable tokens * data/skeletons/glr.c, data/skeletons/lalr1.c, data/skeletons/yacc.c: Fix confusion between the "translatable" and the "translate" tables.	2020-03-28 15:13:27 +01:00
Akim Demaille	84b1972c96	yacc.c: use negative numbers for errors in auxiliary functions yyparse returns 0, 1, 2 since ages (accept, reject, memory exhausted). Some of our auxiliary functions such as yy_lac and yyreport_syntax_error also need to return error codes and also use 0, 1, 2. Because it uses yy_lac, yyexpected_tokens also needs to return "problem", "memory exhausted", but in case of success, it needs to return the number of tokens, so it cannot use 1 and 2 as error code. Currently it uses -1 and -2, which is later converted into 1 and 2 as yacc.c expects it. Let's simplify this and use consistently -1 and -2 for auxiliary functions that are not exposed (or not yet exposed) to the user. In particular this will save the user from having to convert yyexpected_tokens's -2 into yyreport_syntax_error's 2: both return -1 or -2. * data/skeletons/yacc.c (yy_lac, yyreport_syntax_error) (yy_lac_stack_realloc): Return -1, -2 for errors instead of 1, 2. Adjust callers. * examples/c/bistromathic/parse.y (yyreport_syntax_error): Do take error codes into account. Issue a syntax error message even if we ran out of memory. * src/parse-gram.y, tests/local.at (yyreport_syntax_error): Adjust.	2020-03-23 07:02:36 +01:00
Akim Demaille	1079595b2a	style: reduce length of private constant * data/skeletons/glr.c, data/skeletons/lalr1.cc, data/skeletons/yacc.c (YYERROR_VERBOSE_ARGS_MAXIMUM): Rename as... (YYARGS_MAX): this. * src/parse-gram.y (YYERROR_VERBOSE_ARGS_MAXIMUM): Rename as... (ARGS_MAX): this.	2020-03-23 07:02:34 +01:00
Akim Demaille	cbb967dbad	yacc.c: style: prefer switch to if * data/skeletons/yacc.c: Prefer switch to decode yy_lac's return value.	2020-03-17 19:21:07 +01:00
Akim Demaille	44ac18d136	yacc.c: yypstate_expected_tokens In push parsers, when asking for the list of expected tokens at some point, it makes no sense to build a yyparse_context_t: the yypstate alone suffices (the only difference being the lookahead). Instead of forcing the user to build a useless shell around yypstate, let's offer yypstate_expected_tokens. See https://lists.gnu.org/r/bison-patches/2020-03/msg00025.html. * data/skeletons/yacc.c (yypstate): Declare earlier, so that we can use it for... (yypstate_expected_tokens): this new function, when in push parsers. Adjust dependencies. * examples/c/bistromathic/parse.y: Simplify: use yypstate_expected_tokens. Style fixes. Reduce scopes (reported by Joel E. Denny).	2020-03-17 19:20:13 +01:00
Akim Demaille	951da960e6	merge branch 'maint' * upstream/maint: maint: post-release administrivia version 3.5.3 news: update for 3.5.3 yacc.c: make sure we properly propagated the user's number for error diagnostics: don't crash because of repeated definitions of error style: initialize some struct members diagnostics: beware of zero-width characters diagnostics: be sure to close the styling when lines are too short muscles: fix incorrect decoding of $ code: be robust to reference with invalid tags build: fix typo doc: update recommandation for libtextstyle style: comment changes examples: use consistently the GFDL header for readmes style: remove useless declarations typo: succesful -> successful README: point to tests/bison, and document --trace gnulib: update maint: post-release administrivia	2020-03-08 10:13:16 +01:00
Akim Demaille	e3812bb8c3	yacc.c: make sure we properly propagated the user's number for error * data/skeletons/yacc.c (YYERRCODE): Be truthful. * tests/input.at (Redefining the error token): Check that.	2020-03-08 08:10:11 +01:00
Akim Demaille	2353ce7216	yacc.c: simplify yyparse_context_t member names * data/skeletons/yacc.c (yyparse_context_t): Rename yyes_p and yyes_capacity_p as... (yyes, yyes_capacity): These.	2020-03-05 07:26:50 +01:00
Akim Demaille	9cc76ee62c	yacc.c: yyerror_range does not need to be preserved accross calls * data/skeletons/yacc.c (b4_parse_state_variable_macros): Don't define yyerror_range. (yyparse): Add yyerror_range as local variable.	2020-03-05 07:26:49 +01:00
Akim Demaille	2f83ef57f3	yacc.c: push: undefine the pstate macros for the epilogue * data/skeletons/yacc.c (b4_macro_define, b4_macro_undef) (b4_pstate_macro_define, b4_parse_state_variable_macros): New. Use them. * examples/c/bistromathic/parse.y: Remove now useless undefs.	2020-03-05 07:26:49 +01:00
Akim Demaille	744171ddbf	yacc.c: push: initialize the pstate variables in pstate_new Currently pstate_new does not set up its variables, this task is left to yypush_parse. This was probably to share more code with usual pull parsers, where these (local) variables are indeed initialized by yyparse. But as a consequence yyexpected_tokens crashes at the very beginning of the parse, since, for instance, the stacks are not even set up. See https://lists.gnu.org/r/bison-patches/2020-03/msg00001.html. The fix could have very simple, but the documentation actually makes it very clear that we can reuse a pstate for several parses: After yypush_parse returns a status other than YYPUSH_MORE, the parser instance yyps may be reused for a new parse. so we need to restore the parser to its pristine state so that (i) it is ready to run the next parse, (ii) it properly supports yyexpected_tokens for the next run. * data/skeletons/yacc.c (b4_initialize_parser_state_variables): New, extracted from the top of yyparse/yypush_parse. (yypstate_clear): New. (yypstate_new): Use it when push parsers are enabled. Define after the yyps macros so that we can use the same code as the regular pull parsers. (yyparse): Use it when push parsers are _not_ enabled. * examples/c/bistromathic/bistromathic.test: Check the completion on the beginning of the line.	2020-03-05 07:13:23 +01:00
Akim Demaille	4fd3282dd7	style: formatting changes * data/skeletons/yacc.c, tests/torture.at: here.	2020-03-04 08:24:36 +01:00

1 2 3

116 Commits