android13/external/autotest/docs/best-practices.md

25 KiB
Raw Blame History

Autotest Best Practices

When the Chrome OS team started using autotest, we tried our best to figure out how to fit our code and our tests into the upstream style with little guidance and poor documentation. This went poorly. With the benefit of hindsight, were going to lay out some best-practices that wed like to enforce going forward. In many cases, there is legacy code that contradicts this style; we should go through and refactor that code to fit these guidelines as time allows.

Upstream Documentation

There is a sizeable volume of general Autotest documentation available on github: https://github.com/autotest/autotest/wiki

Coding style

Basically PEP-8. See docs/coding-style.md

Where should my code live?

Type of Code Relative Path
client-side tests client/site_tests/
server-side tests server/site_tests
common library code client/common_lib/cros/
server-only library code server/cros

Writing tests

An autotest is really defined by its control file. A control file contains important metadata about the test (name, author, description, duration, what suite its in, etc) and then pulls in and executes the actual test code. This test code can be shared among multiple distinct test cases by parameterizing it and passing those parameters in from separate control files.

Autotests must:

  • Be self-contained: assume nothing about the condition of the device
  • Be hermetic: requiring the Internet to be reachable in order for your test to succeed is unacceptable.
  • Be automatic: avoid user interaction and run-time specification of input values.
  • Be integration tests: if you can test the feature in a unit test (or a chrome browser test), do so.
  • Prefer object composition to inheritance: avoid subclassing test.test to implement common functionality for multiple tests. Instead, create a class that your tests can instantiate to perform common operations. This enables us to write tests that use both PyAuto and Servo without dealing with multiple inheritance, for example.
  • Be deterministic: a test should not validate the timing of some operation. Instead, write a test that records the timing in performance keyvals so that we can track the numbers over time.

Autotests must not:

  • Put significant logic in the control file: control files are really just python, so one can put arbitrary logic in there. Dont. Run your test code, perhaps with some parameters.

Autotests may:

  • Share parameterized fixtures: a test is defined by a control file. Control files import and run test code, and can pass simple parameters to the code they run through a well-specified interface.

Autotest has a notion of both client-side tests and server-side tests. Code in a client-side test runs only on the device under test (DUT), and as such isnt capable of maintaining state across reboots or handling a failed suspend/resume and the like. If possible, an autotest should be written as a client-side test. A server test runs on the autotest server, but gets assigned a DUT just like a client-side test. It can use various autotest primitives (and library code written by the CrOS team) to manipulate that device. Most, if not all, tests that use Servo or remote power management should be server-side tests, as an example.

Adding a test involves putting a control file and a properly-written test wrapper in the right place in the source tree. There are conventions that must be followed, and a variety of primitives available for use. When writing any code, whether client-side test, server-side test, or library, have a strong bias towards using autotest utility code. This keeps the codebase consistent.

Writing a test

This section explains considerations and requirements for any autotest, whether client or server.

Control files

Upstream documentation Our local conventions for autotest control files deviate from the above a bit, but the indication about which fields are mandatory still holds.

Variable Required Value
AUTHOR Yes A comma-delimited string of at least one responsible engineer and a backup engineer -- or at worst a backup mailing list. i.e. AUTHOR = msb, snanda
DEPENDENCIES No list of tags known to the HW test lab.
DOC Yes Long description of the test, pass/fail criteria
NAME Yes Display name of the test. Generally this is the directory where your test lives e.g. hardware_TPMCheck. If you are using multiple run_test calls in the same control file or multiple control files with one test wrapper in the same suite, problems arise with the displaying of your test name. crosbug.com/35795. When in doubt ask.
SYNC_COUNT No Integer >= 1. Number of simultaneous devices needed for a test run.
TIME Yes Test duration: 'FAST' (<1m), 'MEDIUM' (<10m), 'LONG' (<20m), 'LENGTHY' (>30m)
TEST_TYPE Yes Client or Server
ATTRIBUTES No Comma separated list of attribute tags to apply to this control file, used in composing suites. For instance, 'suite:foo, suite:bar'.

Running tests in suites

Make sure that the suite name is listed in site_utils/attribute_allowlist.txt, then add the appropriate attribute to the ATTRIBUTES field in tests that make up the test suite. For instance:

...
ATTRIBUTES = 'suite:suite-a, suite:suite-b'
...

would indicate that the control file above should be run as part of both suite-a and suite-b.

Pure python

Lie, cheat and steal to keep your tests in pure python. It will be easier to debug failures, it will be easier to generate meaningful error output, it will be simpler to get your tests installed and run, and it will be simpler for the lab team to build tools that allow you to quickly iterate.

Shelling out to existing command-line tools is done fairly often, and isnt a terrible thing. The test author can wind up having to do a lot of output parsing, which is often brittle, but this can be a decent tradeoff in lieu of having to reimplement large pieces of functionality in python.

Note that you will need to be sure that any commands you use are installed on the host. For a client-side test, “the host” means “the DUT”. For a server-side test, “the host” typically means “the system running autoserv”; however, if you use SiteHost.run(), the command will run on the DUT. On the server, your tests will have access to all tools common to both a typical CrOS chroot environment and standard Goobuntu.

If you want to use a tool on the DUT, it may be appropriate to include it as a dependency of the chromeos-base/chromeos-test package. This ensures that the tool is pre-installed on every test image for every device, and will always be available for use. Otherwise, the tool must be installed as an autotest “dep”.

Never install your own shell scripts and call them. Anything you can do in shell, you can do in python.

Reporting failures

Autotest supports several kinds of failure statuses:

Status Exception Reason
WARN error.TestWarn error.TestWarn should be used when side effects to the test running are encountered but are not directly related to the test running. For example, if you are testing Wifi and powerd crashes. Currently there are not any clear usecases for this and error.TestWarn should be generally avoided until further notice.
TEST_NA error.TestNAError This test does not apply in the current environment.
ERROR error.TestError The test was unable to validate the desired behavior.
FAIL error.TestFail The test determined the desired behavior failed to occur.

Considerations when writing client-side tests

All client-side tests authored at Google must live in the client/site_tests sub-directory of the autotest source tree.

###Compiling and executing binaries

It is possible to compile source thats included with your test and use the products at test runtime. The build infrastructure will compile this code for the appropriate target architecture and package it up along with the rest of your tests resources, but this increases your development iteration time as you need to actually re-build and re-package your test to deploy it to the device. While we hope to improve tooling support for this use case in the future, avoiding this issue is the ideal.

If you cant avoid this, heres how to get your code compiled and installed as a part of your test:

  1. Create a src/ directory next to your control file.
  2. Put your source, including its Makefile, in src/
  3. define a method in your test class called “setup(self)” that takes no arguments.
  4. setup(self) should perform all tasks necessary to build your tool. There are some helpful utility functions in client/common_lib/utils.py. Trivial example:
    def setup(self):
        os.chdir(self.srcdir)
        utils.make('OUT_DIR=.')

Reusing code (“fixtures”)

Any autotest is, essentially, a single usage of a re-usable test fixture. This is because run_once() in your test wrapper can take any arguments you want. As such, multiple control files can re-use the same wrapper -- and should, where it makes sense.

Considerations when writing server-side tests

All server-side tests authored at Google must live in the server/site_tests sub-directory of the autotest source tree.

It should be even easier to keep the server-side of a test in pure python, as you should simply be driving the DUT and verifying state.

When/why to write a server-side test

Server-side tests are appropriate when some operation in the test can't be executed on the DUT. The prototypical example is rebooting the DUT. Other examples include tests that manipulate the network around the DUT (e.g. WiFi tests), tests that power off the DUT, and tests that rely on a Servo attached to the DUT.

One simple criterion for whether to write a server-side test is this: Is the DUT an object that the test must manipulate? If the answer is “yes”, then a server-side test makes sense.

Control files for server-side tests

Server-side tests commonly operate on the DUT as an object. Autotest represents the DUT with an instance of class Host; the instance is constructed and passed to the test from the control file. Creating the host object in the control file can be done using certain definitions present in the global environment of every control file:

  • Function hosts.create_host() will create a host object from a string with the name of the host (an IP address as a string is also acceptable).
  • Variable machines is a list of the host names available to the test.

Below is a sample fragment for a control file that runs a simple server side test in parallel on all the hosts specified for the test. The fragment is a complete control file, except for the missing boilerplate comments and documentation definitions required in all control files.

def run(machine):
    host = hosts.create_host(machine)
    job.run_test("platform_ServerTest", host=host)

parallel_simple(run, machines)

Note: The sample above relies on a common convention that the run_once() method of a server-side test defines an argument named host with a default value, e.g.

def run_once(self, host=None):
    # … test code goes here.

Operations on Host objects

A Host object supports various methods to operate on a DUT. Below is a short list of important methods supported by instances of Host:

  • run(command) - run a shell command on the host
  • reboot() - reboot the host, and wait for it to be back on the network
  • wait_up() - wait for the host to be active on the network
  • wait_down() - wait until the host is no longer on the network, or until it is known to have rebooted.

More details, including a longer list of available methods, and more about how they work can be found in the Autotest documentation for autoserv and Autotest documentation for Host.

Servo-based tests

For server-side tests that use a servo-attached DUT, the host object has a servo attribute. If Autotest determines that the DUT has a Servo attached, the servo attribute will be a valid instance of a Servo client object; otherwise the attribute will be None.

For a DUT in the lab, Autotest will automatically determine whether there is a servo available; however, if a test requires Servo, its control file must have additional code to guarantee a properly initialized servo object on the host.

Below is a code snippet outlining the requirements; portions of the control file have been omitted for brevity:

# ... Standard boilerplate variable assignments...
DEPENDENCIES = "servo_state:WORKING"
# ... more standard boilerplate...
# servo_state:WORKING - servo is present and can provide required functionality
# servo_state:BROKEN - servo is present but cannot provide required functionality

args_dict = utils.args_to_dict(args)
servo_args = hosts.SiteHost.get_servo_arguments(args_dict)

def run(machine):
    host = hosts.create_host(machine, servo_args=servo_args)
    job.run_test("platform_SampleServoTest", host=host)

parallel_simple(run, machines)

The DEPENDENCIES setting guarantees that if the test is scheduled in the lab, it will be assigned to a DUT that has a servo.

The setting of servo_args guarantees two distinct things: First, it forces checks that will make sure that the Servo is functioning properly; this guarantees that the host's servo attribute will not be None. Second, the code allows you to pass necessary servo specific command-line arguments to test_that.

If the test control file follows the formula above, the test can be reliably called in a variety of ways:

  • When used for hosts in the lab, the hosts servo object will use the servo attached to the host, and the test can assume that the servo object is not None.
  • If you start servod manually on your desktop using the default port, you can use test_that without any special options.
  • If you need to specify a non-default host or port number (e.g. because servod is remote, or because you have more than one servo board), you can specify them with commands like these:
test_that --args=”servo_host=...” …
test_that --args=”servo_port=...” …
test_that --args=”servo_host=... servo_port=...” ...

Calling client-side tests from a server-side test

Commonly, server-side tests need to do more on the DUT than simply run short shell commands. In those cases, a client-side test should be written and invoked from the server-side test. In particular, a client side test allows the client side code to be written in Python that uses standard Autotest infrastructure, such as various utility modules or the logging infrastructure.

Below is a short snippet showing the standard form for calling a client-side test from server-side code:

from autotest_lib.server import autotest

    # ... inside some function, e.g. in run_once()
    client_at = autotest.Autotest(host)
    client_at.run_test("platform_ClientTest")

Writing library code

There is a large quantity of Chromium OS specific code in the autotest codebase. Much of this exists to provide re-usable modules that enable tests to talk to system services. The guidelines from above apply here as well. This code should be as pure python as possible, though it is reasonable to shell out to command line tools from time to time. In some cases weve done this where we could (now) use the services DBus APIs directly. If youre adding code to allow tests to communicate with your service, it is strongly recommended that you use DBus where possible, instead of munging config files directly or using command-line tools.

Currently, our library code lives in a concerning variety of places in the autotest tree. This is due to a poor initial understanding of how to do things, and new code should follow the following conventions instead:

  • Used only in server-side tests: server/cros
  • Used in both server- and client-side tests, or only client: client/common_lib/cros

Adding test deps

This does not refer to the optional DEPENDENCIES field in test control files. Rather, this section discusses how and when to use code/data/tools that are not pre-installed on test images, and should (or can) not be included right in with the test source.

Unfortunately, there is no hard-and-fast rule here. Generally, if this is some small tool or blob of data you need for a single test, you should include it as discussed above in Writing client-side tests. If youre writing the tool, and it has use for developers as well as in one or more tests that youre writing, then make it a first-class CrOS project. Write an ebuild, write unit tests, and then add it to the test image by default. This can be done by RDEPENDing on your new test package from the chromeos-test ebuild.

If your code/data falls in the middle (useful to several tests, not to devs), and/or is large (hundreds of megabytes as opposed to tens) then using an autotest dep may be the right choice. Conceptually, an autotest test dep is simply another kind of archive that the autotest infrastructure knows how to fetch and unpack. There are two components to including a dependency from an autotest test -- setup during build time and installing it on your DUT when running a test. The setup phase must be run from your tests setup() method like so:

def setup(self):
  self.job.setup_dep([mydep])
  logging.debug(mydep is at %s % (os.path.join(self.autodir,
                                                 deps/mydep))

The above gets run when you “build” the test.

The other half of this equation is actually installing the dependency so you can use it while running a test. To do this, add the following to either your run_once or initialize methods:

        dep = dep_name
        dep_dir = os.path.join(self.autodir, 'deps', dep=dep)
        self.job.install_pkg(dep, 'dep', dep_dir)

You can now reference the content of your dep using dep_dir.

Now that you know how to include a dep, the next question is how to write one. Before you read further, you should check out client/deps/* for many examples of deps in our autotest tree.

Create a dep from a third-party package

There are many examples of how to do this in the client/deps directory already. The key component is to check in a tarball of the version of the dependency youd like to include under client/deps/your_dep.

All deps require a control file and an actual python module by the same name. They will also need a copy of common.py to import utils.update_version. Both the control and common are straightforward, the python module does all the magic.

The deps python module follows a standard convention: a setup function and a call to utils.update_version. update_version is used instead of directly calling setup as it maintains additional versioning logic ensuring setup is only done 1x per dep. The following is its method signature:

def update_version(srcdir, preserve_srcdir, new_version, install,
                   *args, **dargs)

Notably, install should be a pointer to your setup function and *args should be filled in with params to said setup function.

If you are using a tarball, your setup function should look something like:

def setup(tarball, my_dir)
    utils.extract_tarball_to_dir(tarball, my_dir)
    os.chdir(my_dir)
    utils.make() # this assumes your tarball has a Makefile.

And you would invoke this with:

utils.update_version(os.getcwd(), True, version, setup, tarball_path,
                     os.getcwd())

Note: The developer needs to call this because def setup is a function they are defining that can take any number of arguments or install the dep in any way they see fit. The above example uses tarballs but some are distributed as straight source under the src dir so their setup function only takes a top level path. We could avoid this by forcing a convention but that would be artificially constraining the deps mechanism.

Once youve created the dep, you will also have to add the dep to the autotest-deps package in chromiumos-overlay/chromeos-base/autotest-deps, cros_workon start it, and re-emerge it.

Create a dep from other chrome-os packages

One can also create autotest deps from code that lives in other CrOS packages, or from build products generated by other packages. This is similar as above but you can reference code using the CHROMEOS_ROOT env var that points to the root of the CrOS source checkout, or the SYSROOT env var (which points to /build/) to refer to build products. Again, read the above. Heres an example of the former with the files I want in chromeos_tree/chromite/my_dep/* where this will be the python code in autotest/files/client/deps/my_dep/my_dep.py module.

import common, os, shutil
from autotest_lib.client.bin import utils

version = 1

def setup(setup_dir):
    my_dep_dir = os.path.join(os.environ['CHROMEOS_ROOT'], 'chromite',
                              'buildbot')
    shutil.copytree(my_dep_dir, setup_dir)


work_dir = os.path.join(os.getcwd(), 'src')
utils.update_version(os.getcwd(), True, version, setup, work_dir)