Augmented Driving

Please note! The experiments in this repository should be considered to be in beta. Significant portions of these experiments are subject to change without warning (although accesible through repo's history). No part of this code should be considered stable.

Building trust towards self driving, one step at a time.

We want to show what an automated driving computer sees at any time and how it reacts. This understanding will bridge the mind-gap for delegating man to machine driving task, accelerating adoption of self driving technologies.

Intended Objectives

Improve driving experience by organizing information overload while focusing easily on the task
Facilitate and fasten self driving adoption by building trust
Use currently deployed worldwide car fleet to fasten market reach

Self driving is a strategic entry point to facilitate interest in EV (Electric Vehicles). It's possible to use the millions of cars already deployed to develop trust on the underlying tech, easing a quicker transition to a healthier way of transporting ourselves, while bringing associated costs down quickly.

The driver will become augmented by real time information coming from different sources that will be perceived and then fed to her. By becoming enhanced, less stress will allow to dedicate more energy to the task at hand.

Feeding will be accomplished by mixing visual and audio cues, hints and suggestions translated from real world inferred events. The person will be able to focus on the main task, avoiding to scatter around, but at the same time, getting all the info from its surroundings in an organized and percievable fashion. Building trust on the automated system through repetitive validations by comparing fed results against the real-world.

We'll get there eventually, but it'll take baby steps (sources) to do so. The first source, will be Traffic Sign Recognition. For it, we'll give a try to a hypothesis on mimicking our species learning and inference processes. If such approach works, it will suppose a leap forward for AGI (Artificial General Intelligence) field evolution, bringing down significantly learning and inferencing, time and energy needs, for some scenarios or tasks.

Our learning hypothesis: Light Learning for Object Recognition & Tracking

We humans, need a sample or two of an image (maybe a lousy sketch will do), to learn a new object. Then, we’re able to recognize it in the wild, with little to no effort, after a brief consolidation period that regularly consists of light repetition to enforce the association, plus some trial and error.

General Objectives

Learning from one (or just a few) sketch or sample
Learn progressively through trial and error
Minimum energy consumption when recognizing learned objects

Context (or Common Sense)

We map objects with a 3-dimensionality in mind. People have the ability to learn from a 2d sketch, and recognize a real 3d object instance (i.e. a cow) when seen (i.e. in the zoo). Laws of physics like gravity, are already internalized in our brains.

Stereoscopic vision [hypothesis]

Looks like a interesting addition, capable of abstracting objects from its context, to be recognized in isolation. If context is removed, and focusing can be a killer feature here, learning to recognize a learnt object in whatever Context, can become a no-brainer.

On a second though, our ability to recognize from photos or drawings that are kind of flattened (2d) versions of their original form, gives us an indication that learnt context may be enough to compensate for stereoscopic vision where not available.

Graph

we can recognize a plant, or a flower, without knowing the species. Maybe Light Learning refers to Generalized or Simple, and Deep Learning, refers to Specialized, and they’re complementary

Attribute Encoding learning approach

Describe from Hard Attributes, and complement with an Image or Two of the target. This technique looks to replace learning from many examples, in favor of deriving from what was learned (in a human-like fashion)

Sources:

This is a dynamic source list containing the high level topics that might be explored.

Traffic Sign Recognition
Abide regulations while driving
Anticipate actions (ie. Lights about to turn Red)
Surroundings (ie. crossing pedestrians)
Short, Mid, and Long term (ie. heading towards a visible twister, speed bump ahead warning)

Source #1: Traffic Sign Recognition

To test our hypothesis, we'll pick this challenge. At a high level, we're planning to learn from a sample or two + attributes [elaborate-here] (color, shape, disposition). At inference-time, coupling these with general cues such as Context, candidate abstraction (segmentation + stereoscopy) and graphing, may provide enough evidence to take an informed guess. It's a piramidal guessing approach, that shares phases.

| Learning | Sample(s) | Attributes (dominant colors, shape, orientation) |
| Inferencing (layer #1) | Graphing | Abstraction (segmentation or stereoscopy) |
| Inferencing (layer #2) | Context (3-dimentionalizing, physics laws) |

From the sample pictogram (that a human could use as a source of truth for learning purposes - in a Driving Manual provided by your local institution), we may get some attributes that are useful for learning and later recognizing it in real world complex imagery that will be feeded. These attributes are: Shape, Primary color, orientation, compound shapes (ie. Stop All-way, Ramp Max Speed), relative size when related (ie. min - max speed limit in highway)

Advances in similar direction

https://www.theverge.com/2025/1/7/24335460/bmw-ces-2025-idrive-heads-up-display-ar

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
samples		samples
src/traffic-sign		src/traffic-sign
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Augmented Driving

Building trust towards self driving, one step at a time.

Intended Objectives

Our learning hypothesis: Light Learning for Object Recognition & Tracking

General Objectives

Context (or Common Sense)

Stereoscopic vision [hypothesis]

Graph

Attribute Encoding learning approach

Sources:

Source #1: Traffic Sign Recognition

Advances in similar direction

About

Languages

License

luisgizirian/augmented-driving

Folders and files

Latest commit

History

Repository files navigation

Augmented Driving

Building trust towards self driving, one step at a time.

Intended Objectives

Our learning hypothesis: Light Learning for Object Recognition & Tracking

General Objectives

Context (or Common Sense)

Stereoscopic vision [hypothesis]

Graph

Attribute Encoding learning approach

Sources:

Source #1: Traffic Sign Recognition

Advances in similar direction

About

Topics

Resources

License

Stars

Watchers

Forks

Languages