bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-06-09 09:12:34 +00:00

Author	SHA1	Message	Date
Paul Eggert	133edcd248	Prefer signed to unsigned integers This patch contains more fixes to prefer signed to unsigned integer types, as modern tools like 'gcc -fsanitize=undefined' can check for signed integer overflow but not unsigned overflow. * NEWS: Document the API change. * boostrap.conf (gnulib_modules): Add intprops. * data/skeletons/glr.c: Include stddef.h and stdint.h, since this skeleton can assume C99 or later. (YYSIZEMAX): Now signed, and the minimum of SIZE_MAX and PTRDIFF_MAX. (yybool) [!__cplusplus]: Now signed (which is how bool behaves). (YYTRANSLATE): Avoid use of unsigned, and make the macro safe even for values greater than UINT_MAX. (yytnamerr, struct yyGLRState, struct yyGLRStateSet, struct yyGLRStack) (yyaddDeferredAction, yyinitStateSet, yyinitGLRStack) (yyexpandGLRStack, yymarkStackDeleted, yyremoveDeletes) (yyglrShift, yyglrShiftDefer, yy_reduce_print, yydoAction) (yyglrReduce, yysplitStack, yyreportTree, yycompressStack) (yyprocessOneStack, yyreportSyntaxError, yyrecoverSyntaxError) (yyparse, yy_yypstack, yypstack, yypdumpstack): * tests/input.at (Torturing the Scanner): Prefer ptrdiff_t to size_t. * data/skeletons/c++.m4 (b4_yytranslate_define): * src/AnnotationList.c (AnnotationList__computePredecessorAnnotations): * src/AnnotationList.h (AnnotationIndex): * src/InadequacyList.h (InadequacyListNodeCount): * src/closure.c (closure_new): * src/complain.c (error_message, complains, complain_indent) (complain_args, duplicate_directive, duplicate_rule_directive): * src/gram.c (nritems, ritem_print, grammar_dump): * src/ielr.c (ielr_compute_ritem_sees_lookahead_set) (ielr_item_has_lookahead, ielr_compute_annotation_lists) (ielr_compute_lookaheads): * src/location.c (columns, boundary_print, location_print): * src/muscle-tab.c (muscle_percent_define_insert) (muscle_percent_define_check_values): * src/output.c (prepare_rules, prepare_actions): * src/parse-gram.y (id, handle_require): * src/reader.c (record_merge_function_type, packgram): * src/reduce.c (nuseless_productions, nuseless_nonterminals) (inaccessable_symbols): * src/relation.c (relation_print): * src/scan-code.l (variant, variant_table_size, variant_count) (variant_add, get_at_spec, show_sub_message, show_sub_messages) (parse_ref): * src/scan-gram.l (<SC_ESCAPED_STRING,SC_ESCAPED_CHARACTER>) (scan_integer, convert_ucn_to_byte, handle_syncline): * src/scan-skel.l (at_complain): * src/symtab.c (complain_symbol_redeclared) (complain_semantic_type_redeclared, complain_class_redeclared) (symbol_class_set, complain_user_token_number_redeclared): * src/tables.c (conflict_tos, conflrow, conflict_table) (conflict_list, save_row, pack_vector): * tests/local.at (AT_YYLEX_DEFINE(c)): Prefer signed to unsigned integer. * data/skeletons/lalr1.cc (yy_lac_check_): * tests/actions.at (_AT_CHECK_PRINTER_AND_DESTRUCTOR): * tests/local.at (AT_YYLEX_DEFINE(c)): Omit now-unnecessary casts. * data/skeletons/location.cc (b4_location_define): * doc/bison.texi (Mfcalc Lexer, C++ position, C++ location): Prefer int to unsigned for line and column numbers. Change example to abort explicitly on memory exhaustion, and fix an off-by-one bug that led to undefined behavior. * data/skeletons/stack.hh (stack::operator[]): Also allow ptrdiff_t indexes. (stack::pop, slice::slice, slice::operator[]): Index arg is now ptrdiff_t, not int. (stack::ssize): New method. (slice::range_): Now ptrdiff_t, not int. * data/skeletons/yacc.c (b4_state_num_type): Remove. All uses replaced by b4_int_type. (YY_CONVERT_INT_BEGIN, YY_CONVERT_INT_END): New macros. (yylac, yyparse): Use them around conversions that -Wconversion would give false alarms about. Omit unnecessary casts. (yy_stack_print): Use int rather than unsigned, and omit a cast that doesn’t seem to be needed here any more. * examples/c++/variant.yy (yylex): * examples/c++/variant-11.yy (yylex): Omit no-longer-needed conversions to unsigned. * src/InadequacyList.c (InadequacyList__new_conflict): Don’t assume node_count is unsigned. src/output.c (muscle_insert_unsigned_table): Remove; no longer used.	2019-10-02 17:11:33 -07:00
Paul Eggert	4d9ff272cf	Prefer signed types for indexes in skeletons * NEWS: Mention this. * data/skeletons/c.m4 (b4_int_type): Prefer char if it will do, and prefer signed types to unsigned if either will do. * data/skeletons/glr.c (yy_reduce_print): No need to convert rule line to unsigned long. (yyrecoverSyntaxError): Put action into an int to avoid GCC warning of using a char subscript. * data/skeletons/lalr1.cc (yy_lac_check_, yysyntax_error_): Prefer ptrdiff_t to size_t. * data/skeletons/yacc.c (b4_int_type): Prefer signed types to unsigned if either will do. * data/skeletons/yacc.c (b4_declare_parser_state_variables): (YYSTACK_RELOCATE, YYCOPY, yy_lac_stack_realloc, yy_lac) (yytnamerr, yysyntax_error, yyparse): Prefer ptrdiff_t to size_t. (YYPTRDIFF_T, YYPTRDIFF_MAXIMUM): New macros. (YYSIZE_T): Fix "! defined YYSIZE_T" typo. (YYSIZE_MAXIMUM): Take the minimum of PTRDIFF_MAX and SIZE_MAX. (YYSIZEOF): New macro. (YYSTACK_GAP_MAXIMUM, YYSTACK_BYTES, YYSTACK_RELOCATE) (yy_lac_stack_realloc, yyparse): Use it. (YYCOPY, yy_lac_stack_realloc): Cast to YYSIZE_T to pacify GCC. (yy_reduce_print): Use int instead of unsigned long when int will do. (yy_lac_stack_realloc): Prefer long to unsigned long when either will do. * tests/regression.at: Adjust to these changes.	2019-10-02 07:10:03 +02:00
Akim Demaille	2ca6b71967	yacc: use the most appropriate integral type for state numbers Currently we properly use the "best" integral type for tables, including those storing state numbers. However the variables for state numbers used in yyparse (and its dependencies such as yy_stack_print) still use int16_t invariably. As a consequence, very large models overflow these variables. Let's use the "best" type for these variables too. It turns out that we can still use 16 bits for twice larger automata: stick to unsigned types. However using 'unsigned' when 16 bits are not enough is troublesome and generates tons of warnings about signedness issues. Instead, let's use 'int'. Reported by Tom Kramer. https://lists.gnu.org/archive/html/bug-bison/2019-09/msg00018.html * data/skeletons/yacc.c (b4_state_num_type): New. (yy_state_num): Be computed from YYNSTATES. * tests/linear: New. * tests/torture.at (State number type): New. Use it.	2019-09-30 18:31:55 +02:00
Akim Demaille	871c02b327	yacc: introduce a type for states * data/skeletons/yacc.c (yy_state_num): New. Use it for arrays of states.	2019-09-30 07:26:17 +02:00
Akim Demaille	a57e74a5bf	style: prefer symbolic values rather than litterals Instead of #define YYPACT_NINF -130 #define yypact_value_is_default(Yystate) \ (!!((Yystate) == (-130))) generate #define YYPACT_NINF (-130) #define yypact_value_is_default(Yyn) \ ((Yyn) == YYPACT_NINF) * data/skeletons/c.m4 (b4_table_value_equals): Add support for $4. * data/skeletons/glr.c, data/skeletons/yacc.c: Use it. Also, use shorter macro argument names, the name of the macro is clear enough.	2019-09-30 07:25:56 +02:00
Akim Demaille	4971409e39	style: change misleading macro argument name * data/skeletons/glr.c, data/skeletons/yacc.c (yypact_value_is_default): It does not take a rule number as argument.	2019-09-30 07:25:48 +02:00
Akim Demaille	b772baef24	Merge remote-tracking branch 'upstream/maint' * upstream/maint: c++: add copy ctors for compatibility with the IAR compiler CI: show git status CI: disable ICC tests: pass -jN from Make to the test suite quotearg: avoid leaks maint: post-release administrivia	2019-09-28 08:09:33 +02:00
Akim Demaille	406e8c7c02	c++: add copy ctors for compatibility with the IAR compiler Reported by Andreas Damm. https://savannah.gnu.org/support/?110032 * data/skeletons/lalr1.cc (stack_symbol_type::operator=): New overload, const, to please the IAR C++ compiler (version ca 2013).	2019-09-27 08:40:52 +02:00
Akim Demaille	97c4169f23	CI: show git status	2019-09-23 06:06:35 +02:00
Akim Demaille	8add45dbd9	CI: disable ICC It seems that Intel changed something in their license management. https://github.com/nemequ/icc-travis/issues/15	2019-09-23 06:06:26 +02:00
Akim Demaille	b3e9c20227	tests: pass -jN from Make to the test suite I am sooooo tired of typing "make -j5 TESTSUITEFLAGS=-j5"... Should have done this years ago. * cfg.mk (TESTSUITEFLAGS): here.	2019-09-23 06:05:39 +02:00
Akim Demaille	b2c381cd25	quotearg: avoid leaks Reported by Tomasz Kłoczko. https://lists.gnu.org/archive/html/bug-bison/2019-09/msg00008.html * src/main.c (main): Free quotearg's memory later.	2019-09-22 18:15:36 +02:00
Akim Demaille	67bff62e31	diagnostics: get the screen width from the terminal * bootstrap.conf: We need winsz-ioctl and winsz-termios. * src/location.c (columns): Use winsize to get the number of columns. Code taken from the GNU Coreutils. * src/location.h, src/location.c (caret_init): New. * src/complain.c (complain_init): Call it. * tests/bison.in: Export COLUMNS so that users of tests/bison can enjoy proper line truncation.	2019-09-22 09:12:08 +02:00
Akim Demaille	5f45cb05f1	diagnostics: don't print ellipsis on the caret line From 9 \| ...TUVWXYZ ABCDEFGHIJKLMNOPQRSTUVWXYZ ABCDEFGHIJKL \| ... ^~~~~~~~~~~~~~~~~~~~~~~~~~ to 9 \| ...TUVWXYZ ABCDEFGHIJKLMNOPQRSTUVWXYZ ABCDEFGHI... \| ^~~~~~~~~~~~~~~~~~~~~~~~~~ * src/location.c (location_caret): here. * tests/diagnostics.at: Adjust expectations.	2019-09-22 09:12:08 +02:00
Akim Demaille	b61b0eb9ac	diagnostics: also show truncation at the end of line with "..." From 9 \| ...TUVWXYZ ABCDEFGHIJKLMNOPQRSTUVWXYZ ABCDEFGHIJKL \| ... ^~~~~~~~~~~~~~~~~~~~~~~~~~ to 9 \| ...TUVWXYZ ABCDEFGHIJKLMNOPQRSTUVWXYZ ABCDEFGHI... \| ... ^~~~~~~~~~~~~~~~~~~~~~~~~~ * src/location.c (location_caret): here. * tests/diagnostics.at: Adjust expectations.	2019-09-22 09:12:08 +02:00
Akim Demaille	69277e109a	diagnostics: check that quoted lines are truncated * tests/diagnostics.at (Screen width: 60 columns, Screen width: 80 columns, Screen width: 200 columns): New tests.	2019-09-22 09:12:08 +02:00
Akim Demaille	f716484627	diagnostics: truncate quoted sources to fit the screen * src/location.c (min_int, columns): New. (location_caret): Compute the line width. Based on it, compute how many columns must be skipped before the quoted location and truncated after, to fit the sceen width. * tests/local.at (AT_QUELL_VALGRIND): Transform into... (AT_SET_ENV_IF, AT_SET_ENV): these. Define COLUMNS to protect the test suite from the user's environment.	2019-09-22 09:12:08 +02:00
Akim Demaille	945b917da2	diagnostics: learn how to count column number with multibyte chars So far diagnostics were cheating: in addition to the 'column' field of locations (based on actual screen width per multibyte characters and on tabulation expansion), the scanner sets the 'byte' field. Diagnostics used this byte count to decide where to insert (color) style. We want to be able to truncate the quoted lines when there are too wide to fit the screen. This requires that the diagnostics learn how to count columns, the byte-in-boundary trick no longer works. Bytes are still used for fix-its. * bootstrap.conf: We need mbfile for mbf_getc. * src/location.c (caret_info): We need an mbfile. (caret_set_file): Initialize it. (caret_getc): Convert to mbfile. (location_caret): Instead of relying on the byte position to decide where to insert the color style, count the current column using boundary_compute.	2019-09-22 09:12:08 +02:00
Akim Demaille	1ef407d923	diagnostics: style: rename member for clariy * src/location.c (caret_info): Now that we no longer have a 'file' member (see previous commit), rename 'source' as 'file'.	2019-09-22 09:12:08 +02:00
Akim Demaille	576b863e91	diagnostics: style: use a boundary to track the caret_info * src/location.c (caret_info): Replace file and line with pos, a boundary. This will allow us to use features of the boundary type, such as boundary_compute.	2019-09-22 09:12:08 +02:00
Akim Demaille	2274c34e91	diagnostics: extract boundary_compute from location_compute The handling of the contributions of the tabulations in the columns is burried inside location_compute. We will soon be willing to use the boundary part of the computation (to compute the current column number each time we read a multibyte char). * src/location.c (boundary_compute): New, extracted from... (location_compute): here.	2019-09-22 09:12:08 +02:00
Akim Demaille	fccab9bc40	diagnostics: style: add caret_set_file To make the following commits easier to read. * src/location.c (caret_set_file): New.	2019-09-22 09:12:08 +02:00
Akim Demaille	488607534a	diagnostics: style: minor changes * src/location.c (location_caret): Factor two branches of an if.	2019-09-22 09:12:08 +02:00
Akim Demaille	4db572dd21	CI: show git status	2019-09-22 09:12:08 +02:00
Akim Demaille	8faf075fd7	git: update ignores	2019-09-22 09:12:08 +02:00
Akim Demaille	453639dfac	git: update ignores	2019-09-22 07:48:10 +02:00
Akim Demaille	4901ee115b	quotearg: avoid leaks Reported by Tomasz Kłoczko. https://lists.gnu.org/archive/html/bug-bison/2019-09/msg00008.html * src/main.c (main): Free quotearg's memory later.	2019-09-21 15:01:45 +02:00
Akim Demaille	6c7b2dfe51	tests: pass -jN from Make to the test suite I am sooooo tired of typing "make -j5 TESTSUITEFLAGS=-j5"... Should have done this years ago. * cfg.mk (TESTSUITEFLAGS): here.	2019-09-14 10:19:13 +02:00
Akim Demaille	a3e201de02	java: handle eof in yytranslate * data/skeletons/lalr1.java (yytranslate_): Handle eof here, as is done in lalr1.cc. * tests/javapush.at: Adjust.	2019-09-14 10:09:08 +02:00
Akim Demaille	5e95bb6251	d: handle eof in yytranslate This changes the traces from Reading a token: Now at end of input. to Reading a token: Next token is token $end (7FFEE56E6474) which is ok. Actually it is even better, as it gives the location when locations are enabled, and is clearer when rules explicitly use the EOF token. * data/skeletons/lalr1.d (yytranslate_): Handle eof here, as is done in lalr1.cc.	2019-09-14 10:09:08 +02:00
Akim Demaille	569125a6bf	regen	2019-09-14 10:09:08 +02:00
Akim Demaille	8ac28ba1f0	parser: use api.token.raw * src/parse-gram.y: Here.	2019-09-14 10:09:08 +02:00
Akim Demaille	3ca713abd0	api.token.raw: document it * doc/bison.texi: here.	2019-09-14 10:09:08 +02:00
Akim Demaille	8c18e3f18c	api.token.raw: cannot be used with character literals * src/parse-gram.y (CHAR): api.token.raw and character literals are mutually exclusive. * tests/input.at (Character literals and api.token.raw): New.	2019-09-14 10:09:08 +02:00
Akim Demaille	1e5e274972	api.token.raw: apply to the other skeletons * data/skeletons/c++.m4, data/skeletons/glr.c, * data/skeletons/lalr1.c, data/skeletons/lalr1.java: Add support for api.token.raw. * tests/scanner.at: Check them.	2019-09-14 09:55:17 +02:00
Akim Demaille	b1679f8346	api.token.raw: check it * tests/local.at (AT_TOKEN_RAW_IF): New. * tests/local.mk: New. Use it.	2019-09-14 09:55:17 +02:00
Akim Demaille	9861bcc540	api.token.raw: implement Bison used to feature %raw, documented as follows: @item %raw The output file @file{@var{name}.h} normally defines the tokens with Yacc-compatible token numbers. If this option is specified, the internal Bison numbers are used instead. (Yacc-compatible numbers start at 257 except for single character tokens; Bison assigns token numbers sequentially for all tokens starting at 3.) Unfortunately, as far as I can tell, it never worked: token numbers are indeed changed in the generated tables (from external token number to internal), yet the code was still applying the mapping from external token numbers to internal token numbers. This commit reintroduces the feature as it was expected to be. * data/skeletons/bison.m4 (b4_token_format): When api.token.raw is enabled, use the internal token number. * data/skeletons/yacc.c (yytranslate): Don't emit if api.token.raw is enabled. (YYTRANSLATE): Adjust.	2019-09-14 09:55:17 +02:00
Akim Demaille	d94d83e10b	style: tidy yacc.c * data/skeletons/yacc.c: Include 'c.m4' first. Then sort the handling of %define variables. * tests/input.at: Adjust.	2019-09-14 09:55:17 +02:00
Akim Demaille	2f6e377953	CI: disable ICC It seems that Intel changed something in their license management. https://github.com/nemequ/icc-travis/issues/15	2019-09-14 09:55:17 +02:00
Akim Demaille	32dff87c1d	diagnostics: fix use of complain_indent * src/symtab.c (symbol_class_set): Here. * tests/diagnostics.at, tests/input.at, tests/regression.at: Adjust expectations.	2019-09-14 09:47:49 +02:00
Akim Demaille	19da501e06	input: stop treating lone CRs as end-of-lines We used to treat lone CRs (\r, aka ^M) as regular NLs (\n), probably to please Classic MacOS. As of today, it makes more sense to treat \r like a plain white space character. https://lists.gnu.org/archive/html/bison-patches/2019-09/msg00027.html * src/scan-gram.l (no_cr_read): Remove. Instead, use... (eol): this new abbreviation denoting end-of-line. * src/location.c (caret_getc): New. (location_caret): Use it. * tests/diagnostics.at (Carriage return): Adjust expectations. (CR NL): New.	2019-09-14 09:23:47 +02:00
Akim Demaille	5e4133175d	Merge tag 'v3.4.2' into HEAD bison 3.4.2 * tag 'v3.4.2': (24 commits) version 3.4.2 CI: always uninstall icc news: more bug fixes thanks to Marc Schönefeld diagnostics: beware of unexpected EOF when quoting the source file gnulib: update build: fix distcheck tests: add noexcept to please GCC 9 news: update fix: don't die when EOF token is defined twice tests: check token redeclaration yacc.c: beware of GCC's -Wmaybe-uninitialized glr.c: initialize vector of bools gnulib: update check for memory exhaustion diagnostics: avoid global variables diagnostics: fix invalid error message indentation git: ignore files generated in gnulib-po c++: avoid duplicate definition of YYUSE gnulib: update CI: more compilers ...	2019-09-12 19:12:24 +02:00
Akim Demaille	0b093ac4d9	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2019-09-12 18:09:26 +02:00
Akim Demaille	69b22b49d4	version 3.4.2 * NEWS: Record release date. v3.4.2	2019-09-12 17:41:12 +02:00
Akim Demaille	ec11f08fb3	CI: always uninstall icc	2019-09-12 13:16:30 +02:00
Akim Demaille	67444a6f0d	news: more bug fixes thanks to Marc Schönefeld	2019-09-12 09:07:48 +02:00
Akim Demaille	4eed3a0f0c	diagnostics: beware of unexpected EOF when quoting the source file When the input file contains lone CRs (aka, ^M, \r), the locations see a new line. Diagnostics look only at \n as end-of-line, so sometimes there is an offset in diagnostics. Worse yet: sometimes we loop endlessly waiting for \n to come from a continuous stream of EOF. Fix that: - check for EOF - beware not to call end_use_class if begin_use_class was not called (which would abort). This could happen if the actual line is shorter that the expected one. Prompted by a (private) report from Marc Schönefeld. * src/location.c (location_caret): here. * tests/diagnostics.at (Carriage return): New.	2019-09-12 07:02:46 +02:00
Akim Demaille	84a6621c78	gnulib: update Contains the creation of the xhash module. https://lists.gnu.org/archive/html/bug-gnulib/2019-09/msg00046.html * src/muscle-tab.c, src/state.c, src/symtab.c, src/uniqstr.c: Use hash_xinitialize.	2019-09-11 09:07:27 +02:00
Akim Demaille	06a273625b	build: fix distcheck * configure.ac (gl_LIBOBJS): Adjust so that the generated files are indeed the expected ones.	2019-09-11 08:27:27 +02:00
Akim Demaille	d120a07e6b	diagnostics: beware of unexpected EOF when quoting the source file When the input file contains lone CRs (aka, ^M, \r), the locations see a new line. Diagnostics look only at \n as end-of-line, so sometimes there is an offset in diagnostics. Worse yet: sometimes we loop endlessly waiting for \n to come from a continuous stream of EOF. Fix that: - check for EOF - beware not to call end_use_class if begin_use_class was not called (which would abort). This could happen if the actual line is shorter that the expected one. Prompted by a (private) report from Marc Schönefeld. * src/location.c (location_caret): here. * tests/diagnostics.at (Carriage return): New.	2019-09-10 19:15:18 +02:00

... 3 4 5 6 7 ...

6726 Commits