Appium automation testing
Learn how to run Appium tests for applications on Linux
Last updated
Learn how to run Appium tests for applications on Linux
Last updated
KDE uses manually triggered unit tests as well as autotests to prevent code changes to our applications from introducing usability issues. Oftentimes, this is not enough for graphical applications, so testing the user interface directly is required. To automate this, KDE uses Appium tests.
is an open source tool based on for automating applications on various mobile and desktop platforms.
Selenium itself is used to automate testing web applications. Appium derives from it to allow for application testing in multiple platforms and in multiple languages.
Appium does this leverage of multiple platforms and languages by using a driver. A driver is a component or interface that acts as a bridge between your test scripts and the application you want to automate. With it, developers can write Selenium-style user interface tests for manipulating the user interface, test accessibility, and measure power consumption.
Appium has the ability to automate Linux apps by using the Linux accessibility API . KDE has a driver for this called , a server for Linux apps that runs the application in a localhost webserver so it can work in non-GUI environments (such as CI jobs), while also being able to run locally in a Wayland compositor (KWin) so the tester can run the tests manually and see the results.
The Linux Driver supports testing applications that implement the AT-SPI2 interface. Applications written in or have good support for the AT-SPI2 interface.
In Qt's case, all you need to do is to follow standard practices.
This tutorial will provide a step-by-step guide to use Selenium AT-SPI to automate system testing of QtQuick applications using Python scripts.
This tutorial requires Python 3.8+ which is included by default in most distributions.
Installation of (an interactive accessibility explorer) and (screen reader) are recommended if you want to test accessibility with Appium tests.
After the development environment is set up, run the command below to build both selenium-webdriver-at-spi and kcalc from source. The command will automatically install all the required dependencies:
Run the example test script to check that your installation is working.
With kdesrc-build:
With kde-builder:
You will see a black window titled "KDE Wayland Compositor" running a KCalc window and having a few buttons getting highlighted (very fast!) as they're activated.
After the test is complete, the window will close itself automatically, and you can see the test result in the console output.
To build Selenium manually, you will need to install its dependencies first:
Then clone the repository and build it:
This will install it in your host root.
Since this tutorial will focus on test scripts, at first we will be using QML code snippets instead of a full project to exemplify their use. First, we should see how an Appium test looks like.
Writing an Appium test consists of three things:
a class that is initialized with Appium options
class members that start with the name "test_"
an entrypoint
At minimum, an Appium test should contain the following:
The "app" capability points to the application that will be tested, and it can be:
The absolute path to an executable and optionally its command line parameters.
The relative path to an executable and optionally its command line parameters (used for autotests when the application is being built).
The desktop file that runs the application, without the file path and including the .desktop
extension.
The second class method functions like a destructor that makes sure the driver will quit once the tests are done. It also lets you terminate any other external processes you might have started for your tests, like a small web server.
Next we use some boilerplate code that allows Appium to run at a localhost web server at a certain port, passing the options and capabilities we have defined. Then, we set a timeout of 10 seconds to allow the test to prepare and run the application when it is executed.
As mentioned before, we need a class method that starts with the name "test_" and that will perform the actual test.
Lastly we create the main entrypoint for our test.
For QML applications, interaction with Appium tests is simple and consists of two steps:
Defining a property to a QML element that makes it locatable.
Using a Selenium Locator to find that QML element.
Essential Locators
For example, if we have a QtQuick.Controls.Button:
A test that clicks said button can be defined like the following:
A string can be directly passed to find_element()
as well. This is useful to access the Accessible.description
of an element:
In a test, we'd use:
You would simply do:
There are many other convenience functions:
Other ways to use locators
The locator would be used like so:
While the accessibility ID is a unique identifier, the Role on the other hand is a generic identifier for an element type, especially used for keyboard navigation. It is used to determine how to traverse focus, for example, in a menu bar, a tool bar, a check box, and other elements. In code terms, this means a QtQuick.Controls.Button and a Kirigami.Action attached to a button would share the same role of "push button", as they are both functionally buttons.
To inspect roles in KDE applications, you first need to open Accerciser, and then run the desired KDE application by passing the environment variable QT_LINUX_ACCESSIBILITY_ALWAYS_ON
, like so:
Discover for example has a test that first waits for the loading screen to vanish before searching for certain user interface elements. To do so, it first searches for the "Loading..." label's Role:
So far, we have seen how to interact with GUI elements, but we haven't tested anything yet.
A common pattern used in KDE software is to write a getresults()
function that retrieves a property from a certain element. This is useful for when multiple UI elements are handled and the result culminates into a single element. The standard calculatortest.py
provided with Selenium AT-SPI is an example of this, with the input of multiple calculator keys being probed before checking the result in the text field:
Another assert pattern is used to test whether the values you get from interacting with the application's UI matches an expected result. This can be done with an assertResult(self, actual, expected)
method. Generally you want for the driver to have finished loading and having run the getresults()
method, so a timeout is needed.
Now you should know everything you need to write your own Appium tests! 🎉
,
,
is the recommended tool to build selenium-webdriver-at-spi from source. Learn .
,
The class can have any name you'd like, conventionally using uppercase, and it must derive from .
The first class method is like a constructor where we perform some setup and can have any name, but it is conventionally called "setUpClass()". AppiumOptions()
provides the default settings for our tests. set_capability()
takes a , analogous to or .
In this case, the first string is the key, the second string is the value. The list of keys and values available can be seen in the . As can be seen in the documentation, the string "app" used above corresponds to the capability "appium:app".
The Qt API that is required to make a QML element locatable is . Typically what is used is or .
To find that QML element, we'd use the aptly named Appium function , to which we pass a .
The locator that is most commonly used for tests in KDE software is . It matches the Accessible.name
set in the QML code.
Another particularly useful Appium function we can use is , which allows to send keyboard input to the application. With a simple QtQuick.Controls.TextField for example:
If you need to enter certain non-typable keys, such as Enter/Return, you can use for that:
Another commonly used function is . It can be used to determine whether an element currently has an Accessible
attribute such as or enabled.
touchscreen
mouse
These can be seen at the .
Additional commands specific to using Python scripts can be seen at the .
In the introductory section , you became aware that Linux has a standard driver for handling accessibility. This driver is used, for example, in the Qt and GTK toolkits, and it specifies things such as accessibility IDs or Roles.
For the accessibility ID, which serves as a unique identifier for a user interface element, we can use the Appium-provided locator , which should match the QML .
The accessibility ID can be used in a generic manner like NAME or "description" as shown above, but its primary use is with QtWidgets applications. You can see a live example of this in .
You can view the available roles in an application by using , an application that allows you to explore accessibility properties in other programs. KDE also has its own accessibility explorer, , which has not yet been packaged by distros.
Then it uses the Selenium function together with the Selenium locator to be sure the "Loading..." element is no longer there.
Generally, tests assert whether the performed action was successful or not. This is done by using the . The most common methods are:
Whenever a test fails, it is customary to take a screenshot of the result so that even tests done in the can be viewed by developers as downloadable artifacts. This can be achieved with in the tearDownClass()
destructor: