bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-06-10 09:42:36 +00:00

Author	SHA1	Message	Date
Akim Demaille	75a605454d	yacc.c: prefer YYSYMBOL_YYERROR to YYSYMBOL_error * data/skeletons/bison.m4 (b4_symbol_sid): Map "error" to YYSYMBOL_YYERROR. * data/skeletons/yacc.c: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	f3c18c8e80	yacc.c: also define a symbol number for the empty token This is not only cleaner, it also protects us from mixing signed values (YYEMPTY is #defined as -2) with unsigned types (the yysymbol_type_t enum is typically compiled as a small unsigned). For instance GCC 9: input.c: In function 'yyparse': input.c:1107:7: error: conversion to 'unsigned int' from 'int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~ input.c:1107:10: error: conversion to 'int' from 'unsigned int' may change the sign of the result [-Werror=sign-conversion] 1107 \| yyn += yytoken; \| ^~~~~~~ input.c:1108:47: error: comparison of integer expressions of different signedness: 'yytype_int8' {aka 'const signed char'} and 'yysymbol_type_t' {aka 'enum yysymbol_type_t'} [-Werror=sign-compare] 1108 \| if (yyn < 0 \|\| YYLAST < yyn \|\| yycheck[yyn] != yytoken) \| ^~ input.c:702:25: error: operand of ?: changes signedness from 'int' to 'unsigned int' due to unsignedness of other operand [-Werror=sign-compare] 702 \| #define YYEMPTY (-2) \| ^~~~ input.c:1220:33: note: in expansion of macro 'YYEMPTY' 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^~~~~~~ input.c:1220:41: error: unsigned conversion from 'int' to 'unsigned int' changes value from '-2' to '4294967294' [-Werror=sign-conversion] 1220 \| yytoken = yychar == YYEMPTY ? YYEMPTY : YYTRANSLATE (yychar); \| ^ Eventually, it might be interesting to move away from -2 (which is the only possible negative symbol number) and use the next available number, to save bits. We could actually even simply use "0" and shift the rest, which would allow to write "!yytoken" to mean really "yytoken != YYEMPTY". * data/skeletons/c.m4 (b4_declare_symbol_enum): Define YYSYMBOL_YYEMPTY. * data/skeletons/yacc.c: Use it. * src/parse-gram.y (yyreport_syntax_error): Use YYSYMBOL_YYEMPTY, not YYEMPTY, when dealing with a symbol. * tests/regression.at: Adjust.	2020-04-01 08:31:48 +02:00
Akim Demaille	3ba001baac	yacc.c: introduce an enum that defines the symbol's number There's a number of advantage in exposing the symbol (internal) numbers: - custom error messages can use them to decide how to represent a given symbol, or a set of symbols. - we need something similar in uses of yyexpected_tokens. For instance, currently, bistromathic's completion() reads: int ntokens = expected_tokens (line, tokens, YYNTOKENS); [...] for (int i = 0; i < ntokens; ++i) if (tokens[i] == YYTRANSLATE (TOK_VAR)) [...] else if (tokens[i] == YYTRANSLATE (TOK_FUN)) [...] else [...] - now that it's a compile-time expression, we can easily build static tables, switch, etc. - some users depended on the ability to get the token number from a symbol to write test cases for their scanners. But Bison 3.5 removed the table this feature depended upon (a reverse yytranslate). Now they can check against the actual symbol number, without having pay (space and time) a conversion. See https://lists.gnu.org/r/bug-bison/2020-01/msg00001.html, and https://lists.gnu.org/archive/html/bug-bison/2020-03/msg00015.html. - it helps us clearly separate the internal symbol numbers from the external token numbers, whose difference is sometimes blurred in the code when values coincide (e.g. "yychar = yytoken = YYEOF"). - it allows us to get rid of ugly macros with inconsistent names such as YYUNDEFTOK and YYTERROR, and to group related definitions together. - similarly it provides a clean access to the $accept symbol (which proves convenient in a current experimentation of mine with several %start symbols). Let's declare this type as a private type (in the .c file, not the .h one). So it does not need to be influenced by the api prefix. * data/skeletons/bison.m4 (b4_symbol_sid): New. (b4_symbol): Use it. * data/skeletons/c.m4 (b4_symbol_enum, b4_declare_symbol_enum): New. * data/skeletons/yacc.c: Use b4_declare_symbol_enum. (YYUNDEFTOK, YYTERROR): Remove. Use the corresponding symbol enum instead.	2020-04-01 08:31:33 +02:00
Akim Demaille	2c74872991	java: move away from _ for internationalization The "_" is becoming a keyword in Java, which causes tons of warnings currently in our test suite. GNU Gettext is now using "i18n" instead of "_" (https://git.savannah.gnu.org/gitweb/?p=gettext.git;a=commitdiff;h=e89fea36545f27487d9652a13e6a0adbea1117d0). * data/skeletons/java.m4: Use "i18n", not "_". * examples/java/calc/Calc.y, tests/calc.at: Adjust.	2020-03-30 08:03:10 +02:00
Akim Demaille	59d820d1ef	c: use YYNOMEM instead of -2 See `84b1972c96`. * data/skeletons/glr.c, data/skeletons/yacc.c (YYNOMEM): New. Use it.	2020-03-28 15:13:27 +01:00
Akim Demaille	90f0500ef8	todo: update * TODO (Token Number): We have to clean this. (Naming conventions, Symbol numbers): New. (Bad styling): Addressed in `e21ff47f5d`.	2020-03-28 15:13:27 +01:00
Akim Demaille	951da960e6	merge branch 'maint' * upstream/maint: maint: post-release administrivia version 3.5.3 news: update for 3.5.3 yacc.c: make sure we properly propagated the user's number for error diagnostics: don't crash because of repeated definitions of error style: initialize some struct members diagnostics: beware of zero-width characters diagnostics: be sure to close the styling when lines are too short muscles: fix incorrect decoding of $ code: be robust to reference with invalid tags build: fix typo doc: update recommandation for libtextstyle style: comment changes examples: use consistently the GFDL header for readmes style: remove useless declarations typo: succesful -> successful README: point to tests/bison, and document --trace gnulib: update maint: post-release administrivia	2020-03-08 10:13:16 +01:00
Akim Demaille	e3812bb8c3	yacc.c: make sure we properly propagated the user's number for error * data/skeletons/yacc.c (YYERRCODE): Be truthful. * tests/input.at (Redefining the error token): Check that.	2020-03-08 08:10:11 +01:00
Akim Demaille	9cc76ee62c	yacc.c: yyerror_range does not need to be preserved accross calls * data/skeletons/yacc.c (b4_parse_state_variable_macros): Don't define yyerror_range. (yyparse): Add yyerror_range as local variable.	2020-03-05 07:26:49 +01:00
Akim Demaille	744171ddbf	yacc.c: push: initialize the pstate variables in pstate_new Currently pstate_new does not set up its variables, this task is left to yypush_parse. This was probably to share more code with usual pull parsers, where these (local) variables are indeed initialized by yyparse. But as a consequence yyexpected_tokens crashes at the very beginning of the parse, since, for instance, the stacks are not even set up. See https://lists.gnu.org/r/bison-patches/2020-03/msg00001.html. The fix could have very simple, but the documentation actually makes it very clear that we can reuse a pstate for several parses: After yypush_parse returns a status other than YYPUSH_MORE, the parser instance yyps may be reused for a new parse. so we need to restore the parser to its pristine state so that (i) it is ready to run the next parse, (ii) it properly supports yyexpected_tokens for the next run. * data/skeletons/yacc.c (b4_initialize_parser_state_variables): New, extracted from the top of yyparse/yypush_parse. (yypstate_clear): New. (yypstate_new): Use it when push parsers are enabled. Define after the yyps macros so that we can use the same code as the regular pull parsers. (yyparse): Use it when push parsers are _not_ enabled. * examples/c/bistromathic/bistromathic.test: Check the completion on the beginning of the line.	2020-03-05 07:13:23 +01:00
Akim Demaille	4cca30d2e6	m4: decommission function generating macro These macros have been extremely useful when we had to support K&R C, which we dropped long ago. Now, they merely make the code uselessly hard to read. * data/skeletons/c.m4, data/skeletons/glr.c, data/skeletons/glr.cc, * data/skeletons/yacc.c: Stop using b4_function_define.	2020-03-02 06:57:50 +01:00
Akim Demaille	30ba94081c	todo: update	2020-02-19 14:57:17 +01:00
Victor Morales Cayuela	e09a72eeb0	diagnostics: modernize the display of submessages Since Bison 2.7, output was indented four spaces for explanatory statements. For example: input.y:2.7-13: error: %type redeclaration for exp input.y:1.7-11: previous declaration Since the introduction of caret-diagnostics, it became less clear. Remove the indentation and display submessages as in GCC: input.y:2.7-13: error: %type redeclaration for exp 2 \| %type <float> exp \| ^~~~~~~ input.y:1.7-11: note: previous declaration 1 \| %type <int> exp \| ^~~~~ * src/complain.h (SUB_INDENT): Remove. (warnings): Add "note" to the enum. * src/complain.h, src/complain.c (complain_indent): Replace by... (subcomplain): this. Adjust all dependencies. * tests/actions.at, tests/diagnostics.at, tests/glr-regression.at, * tests/input.at, tests/named-refs.at, tests/regression.at: Adjust expectations.	2020-02-15 08:28:40 +01:00
Akim Demaille	11e2b755f0	c++: simplify * data/skeletons/stack.hh (ssize): Remove, same as size.	2020-02-12 00:09:15 +01:00
Akim Demaille	f3d33c3613	tests: check calls to yyerror from the user actions This revealed a number of things I had not realized: - the Java location tracking was aliasing the same pair of positions for all the symbols (see previous commit). - in impure parsers, it's quite easy to use incorrect locations for diagnostics, since yyerror uses yylloc, which is the location of the lookahead, not that of the current lhs. So we need something like { YYLTYPE old_yylloc = yylloc; yylloc = @$; yyerror (]AT_PARAM_IF([result, count, nerrs, ])[buf); yylloc = old_yylloc; } Maybe we should do that little yylloc dance in the skeleton instead of leaving it to the user? It might be costly... But that's only for users of the impure parsers, which are asking for trouble anyway. - in glr.cc invoking yyerror is somewhat cumbersome: the C++ interface is not available as we are in yyparse (which in C), and yyerror is used by glr.cc itself to bind it to the user's parser::error. If we call yyerror, we need: yyerror (]AT_LOCATION_IF([[&@$, ]])[yyparser, ]AT_PARAM_IF([result, count, nerrs, ])[msg); However calling yy::parser::error is easier, once we know that the current parser object is available as 'yyparser'. Which also saves us from having to pass the parse-params ourselves: yyparser.error (]AT_LOCATION_IF([[@$, ]])[msg); * tests/calc.at: Invoke yyerror by hand, instead of using fprintf etc. Adjust expectations.	2020-02-12 00:00:05 +01:00
Akim Demaille	80a4389377	java: provide Context with a more OO interface * data/skeletons/lalr1.java (yyexpectedTokens) (yysyntaxErrorArguments): Make them methods of Context. (Context.yysymbolName): New. * tests/local.at: Adjust.	2020-02-08 16:17:53 +01:00
Akim Demaille	ef097719ea	java: add support for parse.error custom * data/skeletons/lalr1.java: Add support for custom parse errors. (yyntokens_): Make it public. Under... (yyntokens): this name. (Context): Capture the location too. * examples/c/bistromathic/parse.y, * examples/c/bistromathic/bistromathic.test: Improve error message. * examples/java/calc/Calc.test, examples/java/calc/Calc.y: Use custom error messages. * tests/calc.at, tests/local.at: Check custom error messages.	2020-02-08 16:03:50 +01:00
Akim Demaille	2d97fe86fd	java: tests: check location tracking in the calculator Unfortunately in the Java skeleton the user cannot override the way locations are displayed, and locations don't know the structure of the positions. So they cannot implement the tricks used in the C/C++ skeletons to display "1.1" instead of "1.1-1.2". * tests/local.at (Java): Add support for column tracking in the locations, as we did in examples/java/calc. * tests/calc.at: Use AT_CALC_YYLEX.	2020-02-05 13:17:00 +01:00
Akim Demaille	b62e063df5	todo: update	2020-01-26 13:29:19 +01:00
Akim Demaille	8426663631	todo: update	2020-01-11 08:59:51 +01:00
Akim Demaille	c67daa9a97	package: bump copyrights to 2020 Run 'make update-copyright'.	2020-01-10 19:16:23 +01:00
Akim Demaille	8036635251	package: bump copyrights to 2020 Run 'make update-copyright'.	2020-01-05 10:26:35 +01:00
Akim Demaille	ac203e6c3c	todo: update * TODO: Schedule some features for 3.6. Remove obsolete stuff.	2019-12-08 10:12:02 +01:00
Akim Demaille	20107b77c0	doc: clearly deprecate YYPRINT * doc/bison.texi (Prologue): Stop using YYPRINT as an example. (The YYPRINT Macro): Clearly show this macro is deprecated.	2019-12-07 15:29:43 +01:00
Akim Demaille	046f238826	d: obey parse.error * data/skeletons/lalr1.d (yysyntax_error): Let the dispatch be bison-time, not runtime.	2019-12-07 13:23:45 +01:00
Akim Demaille	357336d254	glr.c: obey the parse.assert %define variable * data/skeletons/glr.c (YYASSERT): Rename as... (YY_ASSERT): this, for consistency with yacc.c, and also to emphasize the fact that this is not for the end user (YY_ prefix). * tests/glr-regression.at: Define parse.assert.	2019-12-07 13:23:45 +01:00
Akim Demaille	f8d82ff039	warnings: enable -Wuseless-cast, and eliminate warnings Prompted by Frank Heckenbach. https://lists.gnu.org/archive/html/bug-bison/2019-11/msg00016.html. * configure.ac (warn_cxx): Add -Wuseless-cast. * data/skeletons/c.m4 (b4_attribute_define): Define YY_IGNORE_USELESS_CAST_BEGIN and YY_IGNORE_USELESS_CAST_END. * data/skeletons/glr.c (YY_FPRINTF): New, replaces YYFPRINTF, wrapped with YY_IGNORE_USELESS_CAST_BEGIN and YY_IGNORE_USELESS_CAST_END. (YY_DPRINTF): Likewise. * tests/actions.at: Remove useless cast. * tests/headers.at: Adjust.	2019-12-06 08:27:55 +01:00
Akim Demaille	8c87a62308	c++: get rid of symbol_type::token () It is not used. And its implementation was wrong when api.token.raw was defined, as it was still mapping to the external token numbers, instead of the internal ones. Besides it was provided only when api.token.constructor is defined, yet always declared. * data/skeletons/c++.m4 (by_type::token): Remove, useless.	2019-12-01 10:05:48 +01:00
Akim Demaille	869028a66d	d, java: get rid of a useless table * data/skeletons/lalr1.d, data/skeletons/lalr1.java (yytoken_number_): Remove, useless. Was used in ancient C skeletons to support YYPRINT, long obsoleted by %printer.	2019-12-01 07:38:31 +01:00
Akim Demaille	28f1e1546c	C++: finish propagating the unsigned->signed conversion in locations * data/skeletons/location.cc: Remove the u (for unsigned) suffix from the initial line and column. * NEWS: AFAICT, only C++ backends have their location types changed.	2019-10-29 09:15:25 +01:00
Yuichiro Kaneko	3945beb1d2	style: update comment in reader.c rrhs and rlhs were removed by `b2ed6e5826`. * src/reader.c (packgram): Update comment.	2019-10-23 08:32:06 +02:00
Akim Demaille	b47340982b	TODO: more updates	2019-10-15 08:40:50 +02:00
Akim Demaille	ee35055b49	TODO: update	2019-10-15 07:28:33 +02:00
Paul Eggert	6373b90fc8	Port better to C++ platforms * data/skeletons/yacc.c (YYPTRDIFF_T, YYPTRDIFF_MAXIMUM): Default to long, not int. (yy_lac_stack_realloc, yy_lac, yytnamerr, yyparse): Avoid casts to YYPTRDIFF_T that were masking the problem.	2019-10-06 11:59:16 -07:00
Paul Eggert	beceb2fa93	Work around GCC 4.8 false alarms without casts * data/skeletons/yacc.c (yyparse): Initialize yyes_capacity with a signed expression. * tests/local.at (AT_YYLEX_DEFINE(c)): Use enum to avoid cast.	2019-10-06 11:59:16 -07:00
Akim Demaille	2713e7c4ff	TODO: update I no longer agree with that item, there are indeed two things to report: lack of definition, and being useless. We could have either one without the other, they are not directly related.	2019-10-06 08:07:57 +02:00
Akim Demaille	32e5a91a91	yacc.c: work around warnings from G++ 4.8 input.c: In function 'int yyparse()': input.c: error: conversion to 'long int' from 'long unsigned int' may change the sign of the result [-Werror=sign-conversion] yyes_capacity = sizeof yyesa / sizeof yyes; ^ cc1plus: all warnings being treated as errors data/skeletons/yacc.c: here.	2019-10-06 08:07:40 +02:00
Akim Demaille	3ca713abd0	api.token.raw: document it * doc/bison.texi: here.	2019-09-14 10:09:08 +02:00
Akim Demaille	29c9cb3188	lr0: more debug traces * src/lr0.c (kernel_check): New. (new_itemsets, save_reductions): Add traces.	2019-06-09 11:11:12 +02:00
Akim Demaille	ec4d49e129	traces: add some colors This is an experiment. Maybe more styles will be used (in which case a short-hand function will be useful), maybe it will be just reverted. * data/bison-default.css (.traces0): New. * src/lalr.c (lalr): Use it.	2019-06-09 08:36:01 +02:00
Akim Demaille	57290d63fd	package: various fixes for syntax-check * cfg.mk: Disable checks where needed (e.g., we do want to check the behavior with tabs). (sc_at_parser_check): Remove. Unfortunately since `a11c144609` we no longer use the './' prefix to run programs in the current directory. That was so that we could run Java programs like the other, although they are no run with the `./` prefix (see `967a59d2c0`). As a consequence this sc check no longer makes sense. However, since now AT_PARSER_CHECK passes the `./` prefix itself, this sc-check was superfluous. * examples/c/reccalc/scan.l: Use memcpy, not strncpy. * src/ielr.c, src/reader.c: Obfuscate "lr(0)" so that the sc-check for "space before paren" does not fire. * tests/diagnostics.at: Avoid space-tab, use tab-tab.	2019-04-28 08:24:31 +02:00
Akim Demaille	f5a4e279bc	build: use gettext-h We were using the gnulib's gettext module with tricks in bootstrap.conf to avoid useless files. Instead, use gnulib's gettext-h module. * .travis.yml: Force Gettext 0.18.3 on Trusty. * bootstrap.conf: Use gettext-h instead of gettext. (excluded_files): Remove. * configure.ac (AM_GNU_GETTEXT_VERSION): Bump to 0.19.	2019-04-25 22:09:41 +02:00
Akim Demaille	a4d33cdf48	gnulib: let it use its own PO domain See https://www.gnu.org/software/gnulib/manual/html_node/Localization.html. * bootstrap.conf: Create gnulib-po. * Makefile.am, configure.ac: Use it. * po/POTFILES.in: Remove files now in gnulib. * src/main.c: Open the bison-gnulib domain.	2019-04-23 19:28:08 +02:00
Akim Demaille	deec7ca65c	TODO: update Let's prepare 3.4 with more or less what we have. Schedule some features for 3.5 and 3.6. Remove obsolete stuff.	2019-04-23 18:25:30 +02:00
Akim Demaille	2ab70cf0c6	style: comment changes * src/closure.h, src/closure.c, src/lalr.c: here.	2019-04-12 08:38:30 +02:00
Akim Demaille	40fc688765	examples: add a simple infix calculator in C Currently we have no simple example: rpcalc in reverse Polish, mfcalc has functions, and lexcalc is using lex. * examples/c/calc/Makefile, examples/c/calc/calc.y, * examples/c/calc/calc.test, examples/c/calc/local.mk: New.	2019-02-10 17:44:23 +01:00
Akim Demaille	83463dfbee	style: rename LR0.* as lr0.* Let's stick to lower case for file names. * src/LR0.h, src/LR0.c: Rename as... * src/lr0.h, src/lr0.c: these.	2019-01-26 16:21:35 +01:00
Akim Demaille	bb5e4b659b	NEWS: update	2019-01-26 11:31:01 +01:00
Akim Demaille	7d545fd23f	po: remove bitset/stats.c * po/POTFILES.in: here.	2019-01-12 09:38:57 +01:00
Akim Demaille	f0d7f71a64	NEWS: update	2019-01-12 08:19:46 +01:00

1 2 3 4 5

204 Commits