Home Projects Semantic Segmentation in Visual Scene Matching

Semantic Segmentation in Visual Scene Matching

Artificial Intelligence & Data Science

Project Details

Engineering Expertise
- AI & Data Science
- Machine Learning

The Challenge

Visual Scene Matching (VSM) is the process of matching a newly captured image with a database of geotagged reference images to find the best match. This technique enables the construction of a track over time using the information from the best matches, which can be utilised as an Alternative Navigation System. Traditional algorithms that rely on features may face challenges when the scene has changed over time, such as seasonal variations like changes in vegetation or snow, as well as the presence of temporary elements like cars and pedestrians. These temporary features can significantly impact the performance of traditional algorithms.

The Approach

To address these challenges, a Semantic Segmentation algorithm was employed to predict the class of every pixel in the image, allowing the identification of temporary or changing features in the scene. This information was then used to guide feature extraction in specific regions of the scene. The features were extracted using Scale-Invariant Feature Transform (SIFT) within these regions and fed into a Vocabulary Tree algorithm for Visual Scene Matching. Various experiments were conducted to optimise the handling of temporary and changing features, such as removing features from these regions or applying blurring before feature extraction to reduce the extracted features. Additionally, experiments involved assigning different weights to features based on the class they were extracted from, such as assigning higher weights to features from signs and lower weights to features from trees.

The Outcome

By incorporating Semantic Segmentation to enhance the performance of the Vocabulary Tree algorithm, there was a notable 5.4%-point increase in image retrieval accuracy (from 78.4% to 83.3%), while still maintaining the interpretability of the traditional algorithm. This success has paved the way for a subsequent phase involving data fusion utilising a Particle Filter to generate trajectories using a larger dataset.

“Integrating Semantic Segmentation technology into the existing Visual Scene Matching process significantly improved the accuracy of image retrieval results. This advancement is crucial for industries where precise navigation and tracking are essential, such as autonomous vehicles, surveillance systems, and geographic information systems (GIS).”
Josip Rozman, Senior Consultant

The ability to adapt to changing environments and handle temporary scene variations is a key advantage of this approach, offering solutions for real-world applications requiring reliable and robust image matching capabilities.

Related Technical Papers

View All

Non-Invasive Auditory Sensing with Affordable Headphones

This paper presents a sensor for measuring auditory brainstem responses to help diagnose hearing problems away from specialist clinical settings using non-invasive electrodes and commercially available headphones. The challenge of reliably measuring low level electronic signals in the presence of significant noise is addressed via a precision analog processing circuit which includes a novel impedance measurement approach to verify good electrode contact. Results are presented showing that the new sensor was able to reliably sense auditory brainstem responses using noninvasive electrodes, even at lower stimuli levels.

GPU Computing

Power limits restrict CPU speeds, but GPUs offer a solution for faster computing. Initially designed for graphics, GPUs now handle general computing, thanks to advancements by NVIDIA, AMD, and Intel. With hundreds of cores, GPUs significantly outperform CPUs in parallel processing tasks. Modern supercomputers, like Titan, utilize thousands of GPU cores for immense speed. NVIDIA’s CUDA platform simplifies GPU programming, making it accessible for parallel tasks. While GPUs excel in parallelizable problems, they can be limited by data transfer rates and algorithm design. NVIDIA’s Tesla GPUs provide high performance in both single and double precision calculations. Additionally, embedded GPUs like the NVIDIA Jetson TX2 deliver powerful, low-power computing for specialized applications. Overall, GPUs offer superior speed and efficiency for parallel tasks compared to CPUs.

Contact Plextek | Plextek employees check their contact emails on a tablet

Contact Plextek | Employees check their contact emails on a tablet

Got a project in mind?

Let’s talk

If you have got a project to discuss, or even just an idea, let's talk

What we offer

Building Innovation and Transformation Capability

Developing Innovation Strategy

Digital Transformation

Ideation & Concept Creation

Transformative Sustainability

Configurable mmWave Radar Module

Technology Platforms

Ubiquitous Radar

Configurable IoT Framework

Defence

Markets

Satellites & Space

Security

Industrial

Consumer

Healthtech

How we work

Impact

Projects

Articles

Insights

Resources

About Us

Life at Plextek

About Us

Our Facilities

Sustainability

Quality Assurance & Accreditations

Current Vacancies

Early Careers & Summer Opportunities

Semantic Segmentation in Visual Scene Matching

Project Details

The Challenge

The Approach​

The Outcome

Read next

Enhancing network efficiency with ML & RL

Optimising Radar Training with Machine-Assisted Machine Learning

Machine Learning for Rapid Propagation Assessment

Future Sensing: Improving Mobile Ad-hoc Networks

Armour Integrity Monitoring System (AIMS)

mmWave Radar for Foreign Object Debris Detection

Surveillance Radar for Comprehensive Threat Detection

Immersive Technology for Complex Systems Training

Distributed Real Time Spectrum Monitoring

Intelligent Mobility

Telehealth Innovation – Connecting Patients to their Carers

Drone Inspection of Solar Farms