👁️
AI🔬 Ages 11-13Intermediate 9 min read

Computer Vision and Face Recognition

How computer vision and face recognition work: from pixels to features to matching faces, plus real limits, errors, bias and privacy concerns.

Key takeaways

  • Computer vision turns images into numbers and finds patterns in them
  • Face recognition measures features and turns each face into a list of numbers called a faceprint
  • Matching means comparing faceprints to see how similar they are
  • These systems make mistakes and can be biased, which raises real privacy and fairness concerns

What computer vision really is

Humans see with their eyes and brains in an instant. You glance at a busy street and know what is a car, a dog, a friend. Computer vision is the field that tries to give computers a version of this ability. But computers do it in a completely different way, and understanding that difference is the key to this whole topic.

A computer has no eyes and no understanding. What it has is numbers. A photo is stored as a grid of tiny dots called pixels, and each pixel is just numbers describing its colour, usually a value for red, green and blue. If you want the basics of that, see How Computers See Pictures. So before any "vision" happens, an image is already a giant table of numbers.

Computer vision is the set of techniques that find meaningful patterns in those numbers, so the computer can label what is in the image, find objects, or recognise faces.

From pixels to features

Working with raw pixels is hard. A single face photo might have millions of pixel numbers, and they change completely if the lighting shifts or the person turns slightly. So computer vision systems first pull out features: smaller, more useful pieces of information.

Early features are simple, like edges (places where colour changes sharply, which often mark the outline of an object) and corners. From edges you can build up bigger features, like shapes and textures. Modern systems use neural networks to learn which features matter, layer by layer, instead of having people choose them by hand. You can read how those work in Neural Networks Explained.

The important idea is a ladder:

  1. Pixels (raw colour numbers).
  2. Simple features (edges, corners, blobs).
  3. Bigger features (eyes, wheels, leaves).
  4. A label or decision ("this is a face", "this is a cat").

Each step turns a lot of low-level numbers into a smaller, more meaningful summary.

How face recognition works, step by step

Face recognition is one well-known use of computer vision. It actually involves several separate jobs, and people often mix them up.

Step 1: Detection. First the system must find a face in the image at all. This is face detection: it scans the picture looking for the pattern of a face (two eyes, a nose, a mouth in the usual arrangement). This is the part that draws a box around faces in your camera app.

Step 2: Alignment. Faces appear at different angles and sizes. The system rotates and scales the face so the eyes and mouth are in standard positions. This makes the next step fairer and more reliable.

Step 3: Measuring features. Now the system measures the face. It does not store a photo. Instead it produces a long list of numbers that describe the face's features, such as the spacing of the eyes, the shape of the cheekbones, and many subtler things a neural network has learned. This list is called a faceprint or embedding.

Step 4: Matching. To recognise someone, the system compares the new faceprint to faceprints it already has stored. If two faceprints are similar enough, within a chosen limit called a threshold, it declares a match.

Notice the phrase "similar enough". Face recognition never finds a perfect, exact match, because no two photos are identical. It works on how close the numbers are. Whoever builds the system has to choose how close counts as a match, and that choice changes everything.

The threshold trade-off

Choosing the threshold is a balancing act between two kinds of mistakes:

  • A false match (false positive): the system says two different people are the same. Imagine being wrongly identified as someone who committed a crime.
  • A false reject (false negative): the system fails to recognise the right person. Imagine your phone refusing to unlock for you.

If you make matching very strict, you get fewer false matches but more false rejects. If you make it loose, you get the opposite. There is no setting that removes both errors. Every face recognition system lives somewhere on this trade-off, and the right place depends on how serious each mistake is.

Honest limits and real harms

Computer vision is genuinely useful, for sorting photos, helping blind people read signs, and spotting tumours in medical scans. But it has real limits, and pretending otherwise is dangerous.

It does not understand. The system matches patterns of numbers. It has no idea what a face means, who a person is, or why any of it matters. That is why it can be fooled by an unusual photo, a mask, or even a printed picture.

It can be biased. A system learns from its training data. If most of the example faces are of one group of people, the system becomes more accurate for that group and less accurate for others. Real tests have shown higher error rates for women and for people with darker skin in several systems. This is not the computer being unfair on purpose; it is learning from unbalanced examples. The deeper problem is explained in Training Data and Bias.

It raises privacy questions. A faceprint can identify you in a crowd, often without you knowing or agreeing. Unlike a password, you cannot change your face if your faceprint is stolen. This is why many people argue that face recognition needs strong rules, especially when used by shops, schools or the police. The wider question of using these tools well is covered in Using AI Safely and Responsibly.

Using it wisely

Because these systems make mistakes and can be unfair, a sensible rule is simple: never let face recognition make an important decision on its own. A human should check anything that affects someone's safety, money or freedom. Used as a helpful tool with human oversight, computer vision can do a lot of good. Used as an unquestioned judge, it can cause real harm.

Going further

If this topic interests you, the best next step is to understand the engine underneath: Neural Networks Explained shows how machines learn visual patterns, and Coding lets you start building simple image programs of your own.

Quick quiz

Test yourself and earn XP

What is the first thing computer vision does with a photo?

What is a 'faceprint'?

How does the system decide two faces match?

Why can face recognition be unfair?

What is a 'false match'?

FAQ

Usually it stores a faceprint, a list of numbers describing your features, rather than the photo itself. But that faceprint is still very personal. It can identify you, it is hard to change if it leaks, and some systems do keep the original images too. That is why how it is stored and protected matters a lot.

Yes. No system is perfect. It can make a false match (saying you are someone else) or a false reject (failing to recognise you). Studies have shown error rates can be higher for some groups, especially women and people with darker skin, when training data is unbalanced. This is why important decisions should never rely on face recognition alone.