Skip to content

A simple, cross-platform GUI automation toolkit for Python.

License

Notifications You must be signed in to change notification settings

mgregson/autopy

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AutoPy Introduction and Tutorial

by Michael Sanders

Outline

  • Introduction
    • What is AutoPy?
    • Why use AutoPy?
    • What else is AutoPy?
  • Getting Started
    • Requirements
    • Installation
    • Hello World
  • Tutorials
    • Controlling the Mouse
    • Working with Bitmaps
  • Closing & API Reference
  • License
    • Summary
    • MIT License
    • The FreeBSD Documentation License

Introduction

What is AutoPy?

AutoPy is a simple toolkit for automating and scripting repetitive tasks, especially those involving a GUI, with Python. It includes functions for controlling the mouse and keyboard, finding colors and bitmaps on-screen, as well as displaying cross-platform alerts.

Why use AutoPy?

  • Testing a GUI application for bugs and edge cases
    • You might think your app is stable, but what happens if you press that button 5000 times?
  • Automating games
    • Writing a script to beat that crappy flash game can be so much more gratifying than spending hours playing it yourself.
  • Freaking out friends and family
    • Well maybe this isn't really a practical use, but...

What else is AutoPy?

  • Portable
    • Works on Mac OS X, Windows, and X11.
  • Fast
    • Written in pure ANSI C.
  • Simple
    • AutoPy is designed as a toolkit, not a framework -- it doesn't get in your way. At the same time, convenience functions are provided where useful.
  • Easy
    • AutoPy is designed to be simple and easy-to-understand, both for the end user and the implementor; that is, both the public API and the internals are straightforward and well-documented. It should be easy to pick up, and easy to modify if you need.

Getting Started

Requirements

  • Python 2.5+
  • For Mac OS X:
    • zlib
    • libpng (see here for a simple installer)
    • Mac OS 10.5 or later (earlier versions may work, but they are as-of-now untested)
  • For Windows:
    • libpng & zlib (included in both the installer and the source archive)
  • For everything else:
    • libpng & zlib
    • X11 with the XTest extension (also known as the Xtst library)

Installation

Binary Installers

AutoPy installers are currently available for Leopard, Snow Leopard, and Windows. For other platforms, you will have to compile it yourself — fortunately, this is a relatively easy task.

Installing from Source

Check out the latest code:

$ git clone git://github.com/msanders/autopy.git

Compile it:

$ cd autopy
$ python setup.py build

Install it:

# python setup.py install

Hello World

The following is the full source for a "hello world" script in autopy. Running this code will cause an alert dialog to appear on every major platform (make sure you cd out of the build directory before trying this or it may fail on an ImportError due to Python's relative imports):

import autopy
def hello_there_world():
    autopy.alert.alert("Hello, world")
hello_there_world()

Cross platform alerts

Tutorials

Controlling the Mouse

AutoPy includes a number of functions for controlling the mouse. For a full list, consult the API Reference. This short tutorial, however, only gives you a taste of two: autopy.mouse.move() and autopy.mouse.smooth_move(). These functions do exactly what they seem; for instance, to immediately "teleport" the mouse to the top left corner of the screen:

>>> import autopy
>>> autopy.mouse.move(1, 1)

Note that you are able to use the module autopy.mouse despite only importing autopy. This is because the grand autopy module imports all of the modules in the autopy package, so you don't have to.

To move the mouse a bit more realistically, we could use:

>>> import autopy
>>> autopy.mouse.smooth_move(1, 1)

Even better, we could write our own function to move the mouse across the screen as a sine wave:

import autopy
import math
import time
import random

TWO_PI = math.pi * 2.0
def sine_mouse_wave():
	"""
	Moves the mouse in a sine wave from the left edge of
	the screen to the right.
	"""
	width, height = autopy.screen.get_size()
	height /= 2
	height -= 10 # Stay in the screen bounds.

	for x in xrange(width):
		y = int(height * math.sin((TWO_PI * x) / width) + height)
		autopy.mouse.move(x, y)
		time.sleep(random.uniform(0.001, 0.003))

sine_mouse_wave()

Demonstration video

Pretty cool, huh?

Working with Bitmaps

All of autopy's bitmap routines can be found in the function autopy.bitmap (more specifically, most are found in the class autopy.bitmap.Bitmap). A useful way to explore autopy is to use Python's built-in help() function, for example in help(autopy.bitmap.Bitmap). All of autopy's functions are documented with descriptive docstrings, so this should show a nice overview.

There are currently three ways to load a bitmap in autopy: 1.) by taking a screenshot, 2.) by loading a file, or 3.) by parsing a string. The first is probably the most obvious, so I'll start by showing that:

>>> import autopy
>>> autopy.bitmap.capture_screen()
<Bitmap object at 0x12278>

This takes a screenshot of the main screen, copies it to a bitmap, displays its memory address, and then immediately destroys it. Let's do something more useful, like look at its pixel data:

>>> import autopy
>>> autopy.bitmap.capture_screen().get_color(1, 1)
15921906

AutoPy uses a coordinate system with its origin starting at the top-left, so this statement should return the color of pixel at the top-left corner of the screen. The number shown looks a bit unrecognizable, though, but we can fix that:

>>> import autopy
>>> hex(autopy.bitmap.capture_screen().get_color(1, 1))
'0xF2F2F2'

This is obviously an RGB hexadecimal value, identical those used in HTML and CSS. We could also have done:


>>> import autopy
>>> autopy.color.hex_to_rgb(autopy.screen.get_color(1, 1))
(242, 242, 242)

which converts that hex value to a tuple of (r, g, b) values. (Note that autopy.screen.get_color(), used here, is merely a more convenient and efficient version of autopy.bitmap.capture_screen().get_color().)

To save the screen capture to a file, we can use:

>>> import autopy
>>> autopy.bitmap.capture_screen().save('screengrab.png')

The filetype is either parsed automatically from the filename, or given as an optional parameter. AutoPy currently only supports the BMP and PNG filetypes, though, as those are really all that are practical for its purpose.

Loading a bitmap is done essentially the same way, only from a class method:

>>> import autopy
>>> autopy.bitmap.Bitmap.open('i-am-a-monkey-and-i-like-it.png')
<Bitmap object at 0x1001d5378>

Sometimes it is desirable to keep a short script free of any outside dependencies. In the case of bitmaps, this can be accomplished with the to_string() and from_string() methods:

>>> autopy.bitmap.Bitmap.open('foo.png').to_string()
'b2,3,eNpjYGD4f/MwBDGA2QBcMwpt'
>>> autopy.bitmap.Bitmap.from_string('b2,3,eNpjYGD4f/'
								      'MwBDGA2QBcMwpt')
<Bitmap object at 0x12278>

This is not recommended for large bitmaps (a screenshot, for instance, is obviously way too big), but can be useful for short images used in a script you want to be very easily distributable.

Aside from analyzing a bitmap's pixel data, the main use for loading a bitmap is finding it on the screen or inside another bitmap. For example, the following prints the coordinates of the first monkey found in a barrel of monkeys (scanned from left to right, top to bottom):

import autopy
def where_is_the_monkey_i_say():
	"""Look for the monkey. Tell me if you found it."""
	monkey = autopy.bitmap.Bitmap.open('monkey.png')
	barrel = autopy.bitmap.Bitmap.open('barrel.png')

	pos = barrel.find_bitmap(monkey)
	if pos:
		print "We found him! He's at %s!" % str(pos)
	else:
		print "There is no monkey... what kind of barrel is this?!"

where_is_the_monkey_i_say()

As I hope you can see, these functions are enormously useful and have a number of practical values.

Closing & API Reference

Hope you enjoy using autopy! For a more in depth overview, I've attempted to make the API Reference as complete and approachable as possible.

License

Summary

AutoPy (the software) is licensed under the terms of the MIT license.

The documentation for AutoPy is licensed under the terms of the FreeBSD Documentation License.

These licenses are both very liberal and permit nearly anything, including using the code in other projects (as long as credit is given).

MIT License

Copyright © 2010 Michael Sanders.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

The FreeBSD Documentation License

Copyright © 2010 Michael Sanders. All rights reserved.

Redistribution and use in source (Markdown, plaintext, et. al.) and "compiled" forms (HTML, PDF and so forth) with or without modification, are permitted provided that the following conditions are met:

Redistributions of source code (Markdown, plaintext, et. al.) must retain the above copyright notice, this list of conditions and the following disclaimer as the first lines of this file unmodified. Redistributions in compiled form (HTML, PDF and so on) must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

THIS DOCUMENTATION IS PROVIDED "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS DOCUMENTATION, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

About

A simple, cross-platform GUI automation toolkit for Python.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C 92.0%
  • Python 8.0%