| # The MB (Meta-Build wrapper) design spec |
| |
| [TOC] |
| |
| ## Intro |
| |
| MB is intended to address two major aspects of the GYP -> GN transition |
| for Chromium: |
| |
| 1. "bot toggling" - make it so that we can easily flip a given bot |
| back and forth between GN and GYP. |
| |
| 2. "bot configuration" - provide a single source of truth for all of |
| the different configurations (os/arch/`gyp_define` combinations) of |
| Chromium that are supported. |
| |
| MB must handle at least the `gen` and `analyze` steps on the bots, i.e., |
| we need to wrap both the `gyp_chromium` invocation to generate the |
| Ninja files, and the `analyze` step that takes a list of modified files |
| and a list of targets to build and returns which targets are affected by |
| the files. |
| |
| For more information on how to actually use MB, see |
| [the user guide](user_guide.md). |
| |
| ## Design |
| |
| MB is intended to be as simple as possible, and to defer as much work as |
| possible to GN or GYP. It should live as a very simple Python wrapper |
| that offers little in the way of surprises. |
| |
| ### Command line |
| |
| It is structured as a single binary that supports a list of subcommands: |
| |
| * `mb gen -c linux_rel_bot //out/Release` |
| * `mb analyze -m tryserver.chromium.linux -b linux_rel /tmp/input.json /tmp/output.json` |
| |
| ### Configurations |
| |
| `mb` will first look for a bot config file in a set of different locations |
| (initially just in //ios/build/bots). Bot config files are JSON files that |
| contain keys for 'GYP_DEFINES' (a list of strings that will be joined together |
| with spaces and passed to GYP, or a dict that will be similarly converted), |
| 'gn_args' (a list of strings that will be joined together), and an |
| 'mb_type' field that says whether to use GN or GYP. Bot config files |
| require the full list of settings to be given explicitly. |
| |
| If no matching bot config file is found, `mb` looks in the |
| `//tools/mb/mb_config.pyl` config file to determine whether to use GYP or GN |
| for a particular build directory, and what set of flags (`GYP_DEFINES` or `gn |
| args`) to use. |
| |
| A config can either be specified directly (useful for testing) or by specifying |
| the builder group name and builder name (useful on the bots so that they do not |
| need to specify a config directly and can be hidden from the details). |
| |
| See the [user guide](user_guide.md#mb_config.pyl) for details. |
| |
| ### Handling the analyze step |
| |
| The interface to `mb analyze` is described in the |
| [user\_guide](user_guide.md#mb_analyze). |
| |
| The way analyze works can be subtle and complicated (see below). |
| |
| Since the interface basically mirrors the way the "analyze" step on the bots |
| invokes `gyp_chromium` today, when the config is found to be a gyp config, |
| the arguments are passed straight through. |
| |
| It implements the equivalent functionality in GN by calling `gn refs |
| [list of files] --type=executable --all --as=output` and filtering the |
| output to match the list of targets. |
| |
| ## Analyze |
| |
| The goal of the `analyze` step is to speed up the cycle time of the try servers |
| by only building and running the tests affected by the files in a patch, rather |
| than everything that might be out of date. Doing this ends up being tricky. |
| |
| We start with the following requirements and observations: |
| |
| * In an ideal (un-resource-constrained) world, we would build and test |
| everything that a patch affected on every patch. This does not |
| necessarily mean that we would build 'all' on every patch (see below). |
| |
| * In the real world, however, we do not have an infinite number of machines, |
| and try jobs are not infinitely fast, so we need to balance the desire |
| to get maximum test coverage against the desire to have reasonable cycle |
| times, given the number of machines we have. |
| |
| * Also, since we run most try jobs against tip-of-tree Chromium, by |
| the time one job completes on the bot, new patches have probably landed, |
| rendering the build out of date. |
| |
| * This means that the next try job may have to do a build that is out of |
| date due to a combination of files affected by a given patch, and files |
| affected for unrelated reasons. We want to rebuild and test only the |
| targets affected by the patch, so that we don't blame or punish the |
| patch author for unrelated changes. |
| |
| So: |
| |
| 1. We need a way to indicate which changed files we care about and which |
| we don't (the affected files of a patch). |
| |
| 2. We need to know which tests we might potentially want to run, and how |
| those are mapped onto build targets. For some kinds of tests (like |
| GTest-based tests), the mapping is 1:1 - if you want to run base_unittests, |
| you need to build base_unittests. For others (like the telemetry and |
| layout tests), you might need to build several executables in order to |
| run the tests, and that mapping might best be captured by a *meta* |
| target (a GN group or a GYP 'none' target like `webkit_tests`) that |
| depends on the right list of files. Because the GN and GYP files know |
| nothing about test steps, we have to have some way of mapping back |
| and forth between test steps and build targets. That mapping |
| is *not* currently available to MB (or GN or GYP), and so we have to |
| enough information to make it possible for the caller to do the mapping. |
| |
| 3. We might also want to know when test targets are affected by data files |
| that aren't compiled (python scripts, or the layout tests themselves). |
| There's no good way to do this in GYP, but GN supports this. |
| |
| 4. We also want to ensure that particular targets still compile even if they |
| are not actually tested; consider testing the installers themselves, or |
| targets that don't yet have good test coverage. We might want to use meta |
| targets for this purpose as well. |
| |
| 5. However, for some meta targets, we don't necessarily want to rebuild the |
| meta target itself, perhaps just the dependencies of the meta target that |
| are affected by the patch. For example, if you have a meta target like |
| `blink_tests` that might depend on ten different test binaries. If a patch |
| only affects one of them (say `wtf_unittests`), you don't want to |
| build `blink_tests`, because that might actually also build the other nine |
| targets. In other words, some meta targets are *prunable*. |
| |
| 6. As noted above, in the ideal case we actually have enough resources and |
| things are fast enough that we can afford to build everything affected by a |
| patch, but listing every possible target explicitly would be painful. The |
| GYP and GN Ninja generators provide an 'all' target that captures (nearly, |
| see [crbug.com/503241](crbug.com/503241)) everything, but unfortunately |
| neither GN nor GYP actually represents 'all' as a meta target in the build |
| graph, so we will need to write code to handle that specially. |
| |
| 7. In some cases, we will not be able to correctly analyze the build graph to |
| determine the impact of a patch, and need to bail out (e.g,. if you change a |
| build file itself, it may not be easy to tell how that affects the graph). |
| In that case we should simply build and run everything. |
| |
| The interaction between 2) and 5) means that we need to treat meta targets |
| two different ways, and so we need to know which targets should be |
| pruned in the sense of 5) and which targets should be returned unchanged |
| so that we can map them back to the appropriate tests. |
| |
| So, we need three things as input: |
| |
| * `files`: the list of files in the patch |
| * `test_targets`: the list of ninja targets which, if affected by a patch, |
| should be reported back so that we can map them back to the appropriate |
| tests to run. Any meta targets in this list should *not* be pruned. |
| * `additional_compile_targets`: the list of ninja targets we wish to compile |
| *in addition to* the list in `test_targets`. Any meta targets |
| present in this list should be pruned (we don't need to return the |
| meta targets because they aren't mapped back to tests, and we don't want |
| to build them because we might build too much). |
| |
| We can then return two lists as output: |
| |
| * `compile_targets`, which is a list of pruned targets to be |
| passed to Ninja to build. It is acceptable to replace a list of |
| pruned targets by a meta target if it turns out that all of the |
| dependendencies of the target are affected by the patch (i.e., |
| all ten binaries that blink_tests depends on), but doing so is |
| not required. |
| * `test_targets`, which is a list of unpruned targets to be mapped |
| back to determine which tests to run. |
| |
| There may be substantial overlap between the two lists, but there is |
| no guarantee that one is a subset of the other and the two cannot be |
| used interchangeably or merged together without losing information and |
| causing the wrong thing to happen. |
| |
| The implementation is responsible for recognizing 'all' as a magic string |
| and mapping it onto the list of all root nodes in the build graph. |
| |
| There may be files listed in the input that don't actually exist in the build |
| graph: this could be either the result of an error (the file should be in the |
| build graph, but isn't), or perfectly fine (the file doesn't affect the build |
| graph at all). We can't tell these two apart, so we should ignore missing |
| files. |
| |
| There may be targets listed in the input that don't exist in the build |
| graph; unlike missing files, this can only indicate a configuration error, |
| and so we should return which targets are missing so the caller can |
| treat this as an error, if so desired. |
| |
| Any of the three inputs may be an empty list: |
| |
| * It normally doesn't make sense to call analyze at all if no files |
| were modified, but in rare cases we can hit a race where we try to |
| test a patch after it has already been committed, in which case |
| the list of modified files is empty. We should return 'no dependency' |
| in that case. |
| |
| * Passing an empty list for one or the other of test_targets and |
| additional_compile_targets is perfectly sensible: in the former case, |
| it can indicate that you don't want to run any tests, and in the latter, |
| it can indicate that you don't want to do build anything else in |
| addition to the test targets. |
| |
| * It doesn't make sense to call analyze if you don't want to compile |
| anything at all, so passing [] for both test_targets and |
| additional_compile_targets should probably return an error. |
| |
| In the output case, an empty list indicates that there was nothing to |
| build, or that there were no affected test targets as appropriate. |
| |
| Note that passing no arguments to Ninja is equivalent to passing |
| `all` to Ninja (at least given how GN and GYP work); however, we |
| don't want to take advantage of this in most cases because we don't |
| actually want to build every out of date target, only the targets |
| potentially affected by the files. One could try to indicate |
| to analyze that we wanted to use no arguments instead of an empty |
| list, but using the existing fields for this seems fragile and/or |
| confusing, and adding a new field for this seems unwarranted at this time. |
| |
| There is an "error" field in case something goes wrong (like the |
| empty file list case, above, or an internal error in MB/GYP/GN). The |
| analyze code should also return an error code to the shell if appropriate |
| to indicate that the command failed. |
| |
| In the case where build files themselves are modified and analyze may |
| not be able to determine a correct answer (point 7 above, where we return |
| "Found dependency (all)"), we should also return the `test_targets` unmodified |
| and return the union of `test_targets` and `additional_compile_targets` for |
| `compile_targets`, to avoid confusion. |
| |
| ### Examples |
| |
| Continuing the example given above, suppose we have the following build |
| graph: |
| |
| * `blink_tests` is a meta target that depends on `webkit_unit_tests`, |
| `wtf_unittests`, and `webkit_tests` and represents all of the targets |
| needed to fully test Blink. Each of those is a separate test step. |
| * `webkit_tests` is also a meta target; it depends on `content_shell` |
| and `image_diff`. |
| * `base_unittests` is a separate test binary. |
| * `wtf_unittests` depends on `Assertions.cpp` and `AssertionsTest.cpp`. |
| * `webkit_unit_tests` depends on `WebNode.cpp` and `WebNodeTest.cpp`. |
| * `content_shell` depends on `WebNode.cpp` and `Assertions.cpp`. |
| * `base_unittests` depends on `logging.cc` and `logging_unittest.cc`. |
| |
| #### Example 1 |
| |
| We wish to run 'wtf_unittests' and 'webkit_tests' on a bot, but not |
| compile any additional targets. |
| |
| If a patch touches WebNode.cpp, then analyze gets as input: |
| |
| { |
| "files": ["WebNode.cpp"], |
| "test_targets": ["wtf_unittests", "webkit_tests"], |
| "additional_compile_targets": [] |
| } |
| |
| and should return as output: |
| |
| { |
| "status": "Found dependency", |
| "compile_targets": ["webkit_unit_tests"], |
| "test_targets": ["webkit_tests"] |
| } |
| |
| Note how `webkit_tests` was pruned in compile_targets but not in test_targets. |
| |
| #### Example 2 |
| |
| Using the same patch as Example 1, assume we wish to run only `wtf_unittests`, |
| but additionally build everything needed to test Blink (`blink_tests`): |
| |
| We pass as input: |
| |
| { |
| "files": ["WebNode.cpp"], |
| "test_targets": ["wtf_unittests"], |
| "additional_compile_targets": ["blink_tests"] |
| } |
| |
| And should get as output: |
| |
| { |
| "status": "Found dependency", |
| "compile_targets": ["webkit_unit_tests"], |
| "test_targets": [] |
| } |
| |
| Here `blink_tests` was pruned in the output compile_targets, and |
| test_targets was empty, since blink_tests was not listed in the input |
| test_targets. |
| |
| #### Example 3 |
| |
| Build everything, but do not run any tests. |
| |
| Input: |
| |
| { |
| "files": ["WebNode.cpp"], |
| "test_targets": [], |
| "additional_compile_targets": ["all"] |
| } |
| |
| Output: |
| |
| { |
| "status": "Found dependency", |
| "compile_targets": ["webkit_unit_tests", "content_shell"], |
| "test_targets": [] |
| } |
| |
| #### Example 4 |
| |
| Same as Example 2, but a build file was modified instead of a source file. |
| |
| Input: |
| |
| { |
| "files": ["BUILD.gn"], |
| "test_targets": ["wtf_unittests"], |
| "additional_compile_targets": ["blink_tests"] |
| } |
| |
| Output: |
| |
| { |
| "status": "Found dependency (all)", |
| "compile_targets": ["webkit_unit_tests", "wtf_unittests"], |
| "test_targets": ["wtf_unittests"] |
| } |
| |
| test_targets was returned unchanged, compile_targets was pruned. |
| |
| ## Random Requirements and Rationale |
| |
| This section is collection of semi-organized notes on why MB is the way |
| it is ... |
| |
| ### in-tree or out-of-tree |
| |
| The first issue is whether or not this should exist as a script in |
| Chromium at all; an alternative would be to simply change the bot |
| configurations to know whether to use GYP or GN, and which flags to |
| pass. |
| |
| That would certainly work, but experience over the past two years |
| suggests a few things: |
| |
| * we should push as much logic as we can into the source repositories |
| so that they can be versioned and changed atomically with changes to |
| the product code; having to coordinate changes between src/ and |
| build/ is at best annoying and can lead to weird errors. |
| * the infra team would really like to move to providing |
| product-independent services (i.e., not have to do one thing for |
| Chromium, another for NaCl, a third for V8, etc.). |
| * we found that during the SVN->GIT migration the ability to flip bot |
| configurations between the two via changes to a file in chromium |
| was very useful. |
| |
| All of this suggests that the interface between bots and Chromium should |
| be a simple one, hiding as much of the chromium logic as possible. |
| |
| ### Why not have MB be smarter about de-duping flags? |
| |
| This just adds complexity to the MB implementation, and duplicates logic |
| that GYP and GN already have to support anyway; in particular, it might |
| require MB to know how to parse GYP and GN values. The belief is that |
| if MB does *not* do this, it will lead to fewer surprises. |
| |
| It will not be hard to change this if need be. |
| |
| ### Integration w/ gclient runhooks |
| |
| On the bots, we will disable `gyp_chromium` as part of runhooks (using |
| `GYP_CHROMIUM_NO_ACTION=1`), so that mb shows up as a separate step. |
| |
| At the moment, we expect most developers to either continue to use |
| `gyp_chromium` in runhooks or to disable at as above if they have no |
| use for GYP at all. We may revisit how this works once we encourage more |
| people to use GN full-time (i.e., we might take `gyp_chromium` out of |
| runhooks altogether). |
| |
| ### Config per flag set or config per (os/arch/flag set)? |
| |
| Currently, mb_config.pyl does not specify the host_os, target_os, host_cpu, or |
| target_cpu values for every config that Chromium runs on, it only specifies |
| them for when the values need to be explicitly set on the command line. |
| |
| Instead, we have one config per unique combination of flags only. |
| |
| In other words, rather than having `linux_rel_bot`, `win_rel_bot`, and |
| `mac_rel_bot`, we just have `rel_bot`. |
| |
| This design allows us to determine easily all of the different sets |
| of flags that we need to support, but *not* which flags are used on which |
| host/target combinations. |
| |
| It may be that we should really track the latter. Doing so is just a |
| config file change, however. |
| |
| ### Non-goals |
| |
| * MB is not intended to replace direct invocation of GN or GYP for |
| complicated build scenarios (aka ChromeOS), where multiple flags need |
| to be set to user-defined paths for specific toolchains (e.g., where |
| ChromeOS needs to specify specific board types and compilers). |
| |
| * MB is not intended at this time to be something developers use frequently, |
| or to add a lot of features to. We hope to be able to get rid of it once |
| the GYP->GN migration is done, and so we should not add things for |
| developers that can't easily be added to GN itself. |
| |
| * MB is not intended to replace the |
| [CR tool](https://code.google.com/p/chromium/wiki/CRUserManual). Not |
| only is it only intended to replace the gyp\_chromium part of `'gclient |
| runhooks'`, it is not really meant as a developer-facing tool. |