bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-03-09 12:23:04 +00:00

Author	SHA1	Message	Date
Akim Demaille	fe1b448ada	Instead of using make_symbol<TOK_FOO>, generate make_FOO for each token type. Using template buys us nothing, and makes it uselessly complex to construct a symbol. Besides, it could not be generalized to other languages, while make_FOO would work in C/Java etc. * data/lalr1.cc (b4_symbol_): New. (b4_symbol): Use it. (b4_symbol_constructor_declaration_) (b4_symbol_constructor_definition_): Instead of generating specializations of an overloaded template function, just generate several functions whose names are forged from the token names without the token.prefix. (b4_symbol_constructor_declarations): Generate them for all the symbols, not just by class of symbol type, now that instead of specializing a function template by the token, we generate a function named after the token. (b4_symbol_constructor_specialization_) (b4_symbol_constructor_specializations): Remove. * etc/bench.pl.in: Adjust to this new API.	2008-11-15 10:20:02 +01:00
Akim Demaille	5679f31101	%define token.prefix. Provide a means to add a prefix to the name of the tokens as output in the generated files. Because of name clashes, it is good to have such a prefix such as TOK_ that protects from names such as EOF, FILE etc. But it clutters the grammar itself. * data/bison.m4 (token.prefix): Empty by default. * data/c.m4 (b4_token_enum, b4_token_define): Use it. * data/lalr1.cc (b4_symbol): Ditto.	2008-11-13 07:08:24 +01:00
Akim Demaille	3204049e31	Compute at M4 time some of the subtractions. * data/lalr1.cc (b4_substract): New. (b4_rhs_data): Use it.	2008-11-13 07:04:47 +01:00
Akim Demaille	202598d3ab	symbol::token. This is allows the user to get the type of a token return by yylex. * data/lalr1.cc (symbol::token): New. (yytoknum_): Define when %define lex_symbol, independently of %debug. (yytoken_number_): Move into... (symbol::token): here, since that's the only use. The other one is YYPRINT which was not officially supported by lalr1.cc, and anyway it did not work since YYPRINT uses this array under a different name (yytoknum).	2008-11-13 07:01:41 +01:00
Akim Demaille	cb0b136a63	Comment changes. * data/lalr1.cc, data/yacc.c: Fix the description of the yytranslate and yytoknum tables.	2008-11-13 06:52:05 +01:00
Akim Demaille	2c086d2959	Define make_symbol in the header. To reach good performances these functions should be inlined (yet this is to measure precisely). To this end they must be available to the caller. * data/lalr1.cc (b4_symbol_constructor_definition_): Qualify location_type with the class name. Since will now be output in the header, declare "inline". No longer use b4_symbol_constructor_specializations, but b4_symbol_constructor_definitions in the header. Don't call it in the *.cc file.	2008-11-13 06:48:22 +01:00
Akim Demaille	1c4af3813e	Define yytranslate in the header for lex_symbol. * data/lalr1.cc: Move the invocation of b4_yytranslate_definition into the header file when using %define lex_symbol. (yytranslate_): Declare inline.	2008-11-13 06:44:50 +01:00
Akim Demaille	e51b0a82be	Define the constructors of symbol_type in b4_symbol_constructor_definitions. The constructors are called by the make_symbol functions, which a forthcoming patch will move elsewhere. Hence the interest of putting them together. The stack_symbol_type does not need to be moved, it is used only by the parser. * data/lalr1.cc: Move symbol_type and symbol_base_type constructors into... (b4_symbol_constructor_definitions): here. Adjust.	2008-11-13 06:41:42 +01:00
Akim Demaille	788355718f	Make it easier to move the definition of yytranslate_. Forthcoming changes will make it possible to use yytranslate_ from outside the parser implementation file. * data/lalr1.cc (b4_yytranslate_definition): New. Use it.	2008-11-13 06:36:51 +01:00
Akim Demaille	c1e6c88ca3	Remove useless class specification. * data/lalr1.cc (b4_symbol_constructor_specialization_): No need to refer to the class name to use a type defined by the class for arguments of member functions.	2008-11-13 06:33:51 +01:00
Akim Demaille	4654b0c0a8	Finer input type for yytranslate. This patch is debatable: the tradition expects yylex to return an int which happens to correspond to token_number (which is an enum). This allows for instance to return characters (such as '' etc.). But this goes against the stronger typing I am trying to have with the new lex interface which return a symbol_type. So in this case, feed yytranslate_ with a token_type. data/lalr1.cc (yytranslate_): When in %define lex-symbol, expect a token_type.	2008-11-13 06:30:35 +01:00
Akim Demaille	dd735e4ee6	Honor lex-params in %define lex_symbol mode. * data/lalr1.cc: Use b4_lex_param.	2008-11-13 06:27:15 +01:00
Akim Demaille	6659366cda	Simplify names. * src/output.c (symbol_definitions_output): Rename symbol attributes type_name and has_type_name as type and has_type. * data/lalr1.cc: Adjust uses.	2008-11-13 06:24:01 +01:00
Akim Demaille	e9805e5743	Use b4_type_names for the union type. The union used to compute the size of the variant used to iterate over the type of all the symbols, with a lot of redundancy. Now iterate over the lists of symbols having the same type-name. * data/lalr1.cc (b4_char_sizeof_): New. (b4_char_sizeof): Use it. Adjust to be called with a list of numbers instead of a single number. Adjust its caller for new-line issues.	2008-11-13 06:20:59 +01:00
Akim Demaille	aea10ef46f	Define the "identifier" of a symbol. Symbols may have several string representations, for instance if they have an alias. What I call its "id" is a string that can be used as an identifier. May not exist. Currently the symbols which have the "tag_is_id" flag set are those that don't have an alias. Look harder for the id. * src/output.c (is_identifier): Move to... * src/symtab.c (is_identifier): here. * src/symtab.h, src/symtab.c (symbol_id_get): New. * src/output.c (symbol_definitions_output): Use it to define "id" and "has_id". Remove the definition of "tag_is_id". * data/lalr1.cc: Use the "id" and "has_id" whereever "tag" and "tag_is_id" were used to produce code. We still use "tag" for documentation.	2008-11-13 06:17:09 +01:00
Akim Demaille	2ea7730c56	Locations are no longer required by lalr1.cc. * data/lalr1.cc (_b4_args, b4_args): New. Adjust all uses of locations to make them optional. * tests/c++.at (AT_CHECK_VARIANTS): No longer use the locations. (AT_CHECK_NAMESPACE): Check the use of locations. * tests/calc.at (_AT_DATA_CALC_Y): Adjust to be usable with or without locations with lalr1.cc. Test these cases. * tests/output.at: Check lalr1.cc with and without location support. * tests/regression.at (_AT_DATA_EXPECT2_Y, _AT_DATA_DANCER_Y): Don't use locations.	2008-11-11 16:38:10 +01:00
Akim Demaille	c944f7f22d	Simplify lalr1.cc since %defines is mandatory. * data/lalr1.cc: Remove useless calls to b4_defines_if.	2008-11-11 16:05:29 +01:00
Akim Demaille	422c18f48d	Prefer M4 to CPP. * data/lalr1.cc: Use b4_error_verbose_if instead of #if YYERROR_VERBOSE.	2008-11-11 15:59:05 +01:00
Akim Demaille	a0ffc1751e	Support i18n of the parse error messages. * TODO (lalr1.cc/I18n): Remove. * data/lalr1.cc (yysyntax_error_): Support the translation of the error messages, as done in yacc.c. Stay within the yy* pseudo namespace.	2008-11-11 15:55:54 +01:00
Akim Demaille	0927787504	Make it possible to return a symbol_type from yylex. * data/lalr1.cc (b4_lex_symbol_if): New. (parse): When lex_symbol is defined, expected yylex to return the complete lookahead. * etc/bench.pl.in (generate_grammar_list): Extend to support this yylex interface. (bench_variant_parser): Exercise it.	2008-11-11 15:48:52 +01:00
Akim Demaille	39be90223b	Replace yychar with a Boolean. * data/lalr1.cc (parse::yychar): Replace by... (parse::yyempty): this.	2008-11-11 15:36:23 +01:00
Akim Demaille	aba12ad162	Let yytranslate handle the eof case. * data/lalr1.cc (yytranslate_): Handle the EOF case. Adjust callers. No longer expect yychar to be equal to yyeof_, rather, test the lookahead's (translated) kind.	2008-11-11 15:29:39 +01:00
Akim Demaille	27cb5b5901	yychar cannot be empty in yyerrlab. * TODO (yychar == yyempty_): New. * data/lalr1.cc: Remove the handling of this case. This eases forthcoming changes related to yychar and yytranslate.	2008-11-11 15:26:17 +01:00
Akim Demaille	2873fdf8b1	Introduce make_symbol. make_symbol provides a means to construct a full symbol (kind, value, location) in a single shot. It is meant to be a Symbol constructor, parameterized by the symbol kind so that overloading would prevent incorrect kind/value pairs. Unfortunately parameterized constructors do not work well in C++ (unless the parameter also appears as an argument, which is not acceptable), hence the use of a function instead of a constructor. * data/lalr1.cc (b4_symbol_constructor_declaration_) (b4_symbol_constructor_declarations) (b4_symbol_constructor_specialization_) (b4_symbol_constructor_specializations) (b4_symbol_constructor_definition_) (b4_symbol_constructor_definitions): New. Use them where appropriate to generate declaration, declaration of the specializations, and implementations of the templated overloaded function "make_symbol". (variant::variant): Always define a default ctor. Also provide a copy ctor. (symbol_base_type, symbol_type): New ctor overloads for value-less symbols. (symbol_type): Now public, so that functions such as yylex can use it.	2008-11-11 15:16:53 +01:00
Akim Demaille	247efe346c	Formatting changes.	2008-11-10 12:01:19 +01:00
Akim Demaille	5d73144067	More information about the symbols. * src/output.c (type_names_output): Document all the symbols, including those that don't have a type-name. (symbol_definitions_output): Define "is_token" and "has_type_name". * data/lalr1.cc (b4_type_action_): Skip symbols that have an empty type-name, now that they are defined too in b4_type_names.	2008-11-10 11:58:01 +01:00
Akim Demaille	6ed15cde29	Make parser::yytranslate static. Small speedup (1%) on the list grammar. And makes yytranslate_ available in non member functions. * data/lalr1.cc (yytranslate_): Does not need to be a instance function.	2008-11-10 11:50:57 +01:00
Akim Demaille	30bb2edccf	Avoid trailing spaces. * data/c.m4: b4_comment(TEXT): Don't indent empty lines. * data/lalr1.cc: Don't indent before rule and symbol actions, as they can be empty, and anyway this incorrectly indents the first action.	2008-11-10 11:47:49 +01:00
Akim Demaille	914202bdac	Use "enum" for integral constants. This is just nicer to read, I observed no speedup. * data/lalr1.cc (yyeof_, yylast_, yynnts_, yyempty_, yyfinal_) (yterror_, yyerrcode_, yyntokens_): Define as members of an enum. (yyuser_token_number_max_, yyundef_token_): Move into... (yytranslate_): here.	2008-11-10 11:41:00 +01:00
Akim Demaille	b9855ea55b	Formatting changes. * data/lalr1.cc: here.	2008-11-10 11:32:12 +01:00
Akim Demaille	4c3cc7da5d	Classify symbols by type-name. * src/uniqstr.h (UNIQSTR_CMP): New. * src/output.c (symbol_type_name_cmp, symbols_by_type_name) (type_names_output): New. (muscles_output): Use it. * data/lalr1.cc (b4_symbol_action_): Remove. (b4_symbol_case_, b4_type_action_): New. Adjust uses of b4_symbol_action_ to use b4_type_action_.	2008-11-10 11:25:36 +01:00
Akim Demaille	d69c9694a7	Change the handling of the symbols in the skeletons. Before we were using tables which lines were the symbols and which columns were things like number, tag, type-name etc. It is was difficult to extend: each time a column was added, all the numbers had to be updated (you asked for colon $2, not for "tag"). Also, it was hard to filter these tables when only a subset of the symbols (say the tokens, or the nterms, or the tokens that have and external number and a type-name) was of interest. Now instead of monolithic tables, we define one macro per cell. For instance "b4_symbol(0, tag)" is a macro name which contents is self-decriptive. The macro "b4_symbol" provides easier access to these cells. * src/output.c (type_names_output): Remove. (symbol_numbers_output, symbol_definitions_output): New. (muscles_output): Call them. (prepare_symbols): Define b4_symbols_number.	2008-11-10 11:21:50 +01:00
Akim Demaille	e5eb92e794	Support constructor with an argument. This improves the "list" bench by 2%. * data/lalr1.cc (variant::build): Add an overloaded version with an argument. * tests/c++.at (AT_CHECK_VARIANT): Check it.	2008-11-10 11:04:31 +01:00
Akim Demaille	5de9c59301	Use a static hierarchy for symbols in the C++ parser. * data/lalr1.cc (symbol_base_type, symbol_type) (stack_symbol_type): Make it a static hierarchy. Adjust dependencies.	2008-11-09 19:57:30 +01:00
Akim Demaille	d3be4f6d42	Use inline for small operations. * data/lalr1.cc (symbol_base_type, symbol_type) (stack_symbol_type): Declare constructor and other operations as inline. (yy_destroy_): Inline.	2008-11-09 19:51:28 +01:00
Akim Demaille	1f7d007bf6	Introduce a hierarchy for symbols. * data/lalr1.cc (symbol_base_type, symbol_type): New. (data_type): Rename as... (stack_symbol_type): this. Derive from symbol_base_type. (yy_symbol_value_print_): Merge into... (yy_symbol_print_): this. Rename as... (yy_print_): this. (yydestruct_): Rename as... (yy_destroy_): this. (b4_symbols_actions, YY_SYMBOL_PRINT): Adjust. (parser::parse): yyla is now of symbol_type. Use its type member instead of yytoken.	2008-11-09 19:48:20 +01:00
Akim Demaille	bc0b0477e2	Rename data_type and stack_symbol_type. * data/lalr1.cc (data_type): Rename as... (stack_symbol_type): this.	2008-11-09 19:45:14 +01:00
Akim Demaille	57295d14f9	Handle semantic value and location together. * data/lalr1.cc (b4_symbol_actions): Bounce $$ and @$ to yydata.value and yydata.location. (yy_symbol_value_print_, yy_symbol_print_, yydestruct_) (YY_SYMBOL_PRINT): Now take semantic value and location as a single arg. Adjust all callers. (yydestruct_): New overload for a stack symbol.	2008-11-09 19:42:08 +01:00
Akim Demaille	e9b0834e18	Push a complete symbol, not connected parts. * data/lalr1.cc (yypush_): Take a data_type&, not disconnected state, value and location. Adjust callers.	2008-11-09 19:39:09 +01:00
Akim Demaille	6082531abb	Agregate yylval and yylloc. * data/lalr1.cc (parser::yylval, parser::yylloc): Replace by... (parser::yyla): this.	2008-11-09 19:36:04 +01:00
Akim Demaille	33c195cc37	Rely on the state stack to display reduction traces. To display rhs symbols before a reduction, we used information about the rule reduced, which required the tables yyrhs and yyprhs. Now use rely only on the state stack to get the same information. * data/lalr1.cc (b4_rhs_data, b4_rhs_state): New. Use them. (parser::yyrhs_, parser::yyprhs_): Remove. (parser::yy_reduce_print_): Use the state stack.	2008-11-09 19:33:04 +01:00
Akim Demaille	e1f93869da	Fuse yyval and yyloc into yylhs. * data/lalr1.cc (b4_lhs_value, b4_lhs_location): Adjust to using yylhs. (parse): Replace yyval and yyloc with yylhs.value and yylhs.location. After a user action, compute yylhs.state earlier. (yyerrlab1): Do not play tricks with yylhs.location, rather, use a fresh error_token.	2008-11-09 19:29:38 +01:00
Akim Demaille	9380cfd008	Moving push traces into yypush_. * data/lalr1.cc (yypush_): Now takes a optional trace message. Adjust all uses.	2008-11-07 21:38:54 +01:00
Akim Demaille	8901f32e4a	The single-stack C++ parser is now the standard one. * data/lalr1.cc: Rename as... * data/lalr1-split.cc: this. * data/lalr1-fusion.cc: Rename as... * data/lalr1.cc: this. * etc/bench.pl.in: Adjust.	2008-11-07 21:38:45 +01:00
Akim Demaille	a0d4650a09	Remove spurious initial empty lines. * data/glr.c, data/glr.cc, data/lalr1.cc, data/lalr1.java, * data/yacc.c: End the @output lines with an @.	2008-11-04 21:43:36 +01:00
Akim Demaille	a9ce3f5413	Fix output of copyright years. * data/bison.m4 (b4_copyright): Fix the indentation of the copyright year paragraph. Use b4_copyright_years when no years are given. * data/lalr1.cc, data/lalr1-fusion.cc, data/location.cc (b4_copyright_years): New. Use it.	2008-11-04 21:21:38 +01:00
Akim Demaille	7dedf26e55	Push the state, value, and location at the same time. This is needed to prepare a forthcoming patch that fuses the three stacks into one. * data/lalr1.cc (parser::yypush_): New. (parser::yynewstate): Change the semantics: instead of arriving to this label when value and location have been pushed, but yystate is to be pushed on the state stack, now the three of them must have been pushed before. yystate still must be the new state. This allows to use yypush_ everywhere instead of individual handling of the stacks.	2008-11-03 21:51:02 +01:00
Akim Demaille	c4585f1e2d	Prefer references to pointers. * data/lalr1.cc (b4_symbol_actions): New, overrides the default C definition to use references instead of pointers. (yy_symbol_value_print_, yy_symbol_print_, yydestruct_): Take the value and location as references. Adjust callers.	2008-11-03 21:50:57 +01:00
Akim Demaille	56017c172b	stack::size instead of stack::height. * data/lalr1.cc (stack::height): Rename as... (stack::size): this. Fix the output type. Comment changes.	2008-11-03 21:50:53 +01:00
Akim Demaille	5ab8c47bcf	Use variants to support objects as semantic values. This patch was inspired by work by Michiel De Wilde. But he used Boost variants which (i) requires Boost on the user side, (ii) is slow, and (iii) has useless overhead (the parser knows the type of the semantic value there is no reason to duplicate this information as Boost.Variants do). This implementation reserves a buffer large enough to store the largest objects. yy::variant implements this buffer. It was implemented with Quentin Hocquet. * src/output.c (type_names_output): New. (output_skeleton): Invoke it. * data/c++.m4 (b4_variant_if): New. (b4_symbol_value): If needed, provide a definition for variants. * data/lalr1.cc (b4_symbol_value, b4_symbol_action_) (b4_symbol_variant, _b4_char_sizeof_counter, _b4_char_sizeof_dummy) (b4_char_sizeof, yy::variant): New. (parser::parse): If variants are requested, define parser::union_type, parser::variant, change the definition of semantic_type, construct $$ before running the user action instead of performing a default $$ = $1. * examples/variant.yy: New. Based on an example by Michiel De Wilde.	2008-11-03 21:50:48 +01:00

1 2 3 4 5

214 Commits