Revision 8e8b5e1b7f285bf347faa6ade2d5acd6e72e63ae (click the page title to view the current version)

Introductory Session to Machine Learing

Reading

  • Ma 2004 Chapter 1.
  • (Szeliski 2022 Chapter 1)

Session

  1. Briefing Introduction
  2. Exercise Python Setup (below)
  3. Debrief
    • Recap on linear algebra (introductory slides continued)
    • Lecture Notes on Change of Basis

Practical: Python Setup

Today’s task is to install and test a number of software packages on your computers. The goal is not to produce the output that I ask for, but to make sure the software works for you on your system.

Install Python

We will use Python 3 in this module. There is a myriad of different ways to install and use python, and you have probably used one or another in other modules already.

During this module, we will occasionally work with external USB cameras, and we will also use libraries which open windows to view video and graphics. This can be difficult to get to work in Jupyter, but apart from this, you should use whatever platform you are most comfortable with.

Alternative plaforms

Working with the command line

You need to install the following.

  1. Python
  2. pip. (the package manager for python)
  3. iPython (a more convenient interactive interpreter)

How you install these three packages depends on your OS. In most distroes you can use the package system to install all of this, for instance in Debian/Ubuntu:

(Python is installed automatically as a dependency of iPython3. Note, you have to specify version 3 in the package names, lest you get Python 2.)

Anaconda/Miniconda

Anaconda and Miniconda is a platform for managing python installations. This is recommended for Windows users.

For installation, download miniconda or anaconda

Miniconda¶. Miniconda is a free minimal installer for conda. It is a small, bootstrap version of Anaconda that includes only conda, Python, the packages they depend on, and a small number of other useful packages, including pip, zlib and a few others.

  • docs.conda.io

Create virtualenvironment with

e.g.

Activate with

e.g.

Install with pip install, or conda install

Creating environment

Managing environments — conda 4.10.3.post24+d808108b6 documentation

An explicit spec file is not usually cross platform, and therefore has a comment at the top such as # platform: osx-64 showing the platform where it was created. This platform is the one where this spec file is known to work.

Install Python Packages

Python packages are installed most easily using python’s own packaging tool, pip, which is independent of the OS. It is run from the command line.

Depending on how you installed pip, it may be a good idea to upgrade

Then we install the libraries we need. You can choose to install either in user space or as root.

User space:

As root:

  • numpy is a standard library for numeric computations. In particular it provides a data model for matrices with the appropriate arithmetic functions.
  • matplotlib is a comprehensive library for plotting, both in 2D and 3D.
  • OpenCV is a Computer Vision library, written in C++ with bindings for several different languages.

A third installation alternative is to use Virtual Environments, which allows you to manage python versions and dependencies separately for each project. This may be a good idea if you have many python projects, but if this is your first one, it is not worth the hassle.

Run iPython

Exactly how you run iPython may depend on you OS. In Unix-like systems we can run it straight from the command line:

This should look something like this:

georg$ ipython3 
Python 3.7.3 (default, Jul 25 2020, 13:03:44) 
Type 'copyright', 'credits' or 'license' for more information
IPython 7.3.0 -- An enhanced Interactive Python. Type '?' for help.

In [1]: print("Hello World")                                                    
Hello World

In [2]: import numpy as np                                                      

In [3]: np.sin(np.pi)                                                           
Out[3]: 1.2246467991473532e-16

In [4]: np.sqrt(2)                                                              
Out[4]: 1.4142135623730951

In [5]:                                                                         

Some 3D Operations

In this chapter, we will define a simple 3D Object and display it in python. The 3D object is an irregular tetrahedron, which has four corners and four faces.

Firtsly, we define the three corners of the tetrahedron.

Each face is adjacent to three out of the four corners, and can also be defined by these corners.

To represent the 3D structure for use in 3D libraries, we juxtapose all the faces and cast it as a matrix.

Observe that the vertices (corners) are rows of the matrix. The mathematical textbook model has the corners as columns, and this is something we will have to deal with later.

We define the 3D object ob as follows.

The alpha parameter makes the object opaque. You may also want to play with colours:

To display the object, we need to create a figure with axes.

Note the plt.ion() line. You do not use this in scripts, but in ipython it means that control returns to the prompt once the figure is shown. It is necessary to continue modifying the plot after it has been created.

Now, we can add our object to the plot.

Note It seems that some versions of matplotlib do not work with the above definition of vertices, and rather require a 3D array, as follows:

You can try that, but you may run into trouble when we rotate the figure later. I am looking further into the compatibility between versions.

Quite likely, the object shows outside the range of the axes. We can fix this as follows:

These commands make sure that the axes are scalled so that the two points (-2,-2,-2) and (2,2,2) (defined in the list s) are shown within the domain.

Rotation and Translation of 3D objects

Continuing on the previous section, our 3D object ob is defined by the vertices matrix, where all the rows are points in space. Motion is described by matrix operations on vertices.

Translation

Let us define another vector in \(\mathbb{R}^3\) and add it to each point.

Not that this operation does not make sense in conventional mathematics. We have just added a \(1\times3\) matrix to an \(N\times3\) matrix. How does python interpret this in terms of matrices?

To see what this means visually in 3D space, we can generate a new 3D object from v2. We use a different face colour for clarity.

How does the new object relate to the first one in 3D space?

TODO Check

Rotation

In the previous test, we added a vector to the nodes in the 3D polyhedron. Let’s try instead to multiply by a matrix, like this:

This gives us a new polyhedron, like this:

TODO Check

Removing a 3D Object

Unfortunately, matplotlib.pyplot is not designed for interactive construction or animation of 3D graphics, so some things are little bit tricky. However, it is possible to remove an existing object from the plot. Assume we still have the objects from the last few sections.

First of all, let’s look at the objects we have plotted:

In [12]: ax.collections
Out[12]:
[<mpl_toolkits.mplot3d.art3d.Poly3DCollection at 0x7fdde8a074f0>,
 <mpl_toolkits.mplot3d.art3d.Poly3DCollection at 0x7fdde89fe400>,
 <mpl_toolkits.mplot3d.art3d.Poly3DCollection at 0x7fddf39b28b0>]

In [13]:

Here, ax is the axes system of the figure, and ax.collections is a collection of all of the objects that we have plotted. We can quite simply delete one and refresh the figure.

What happens?

Some Camera Operations

Now, ret should be True, indicating that a frame has successfully been read. If it is False, the following will not work.

You should see a greyscale image from your camera. To close the camera and the window, we run the following.

This example is digested from the tutorial on Getting Started with Videos. You may want to do the rest of the tutorial.

Testing external cameras (optional)

We have a box of external cameras for testing. It is useful to repeat the above exercise with an external camera. That means you have to change the camera selection (0) in this line:

cap = cv.VideoCapture(0)

In unix-like systems, you can use the device name, e.g. /dev/video1 in lieu of the number 0.
Thus you can double-check the camera connection with other capturing software.

Showing a frame (optional)

The following example may visualise rotations somewhat better than the tetrahedron given above. Try it, and observe to learn how it workse.

Step 1 Load libraries and set up a figure with axis system.

We can visualise just the unit vectors, using a quiver plot. We make the qplot function so that we can plot a rotated version afterwards.

If you want to, you can show the plot at this stage.

Now we rotate the unit vectors, and make a new quiver plot.

Finally, we need to scale and show the plot. The set_xlim3d and similar functions make arrow heads.

STL Files (even more optional)

If you have time, you can install and test the numpy-stl library as well. We shall use it next week.