RapidEarth

A Geospatial Searchengine

Data exploration and analysis in various domains often necessitate the search for specific objects in massive databases. A common search strategy, often known as search-by-classification, resorts to training machine learning models on small sets of positive and negative samples and to performing inference on the entire database to discover additional objects of interest. While such an approach often yields very good results in terms of classification performance, the entire database usually needs to be scanned, a process that can easily take several hours even for medium-sized data catalogs. In this work, we present RapidEarth, a geospatial search-by-classification engine that allows analysts to rapidly search for interesting objects in very large data collections of satellite imagery in a matter of seconds, without the need to scan the entire data catalog. RapidEarth embodies a co-design of multidimensional indexing structures and decision branches, a recently proposed variant of classical decision trees. These decision branches allow RapidEarth to transform the inference phase into a set of range queries, which can be efficiently processed by leveraging the forementioned multidimensional indexing structures. The main contribution of this work is a geospatial search engine that implements these technical findings.

Visual search interface of RapidEarth. Left: User query is defined by selecting positive (red) and negative (blue) patches that resemble the query set. Right: Visualization of query results, where yellow patches correspond to the found patches that fit the query and green patches are found objects that were also part of the query set.

The results of the entire dataset can be inspected by navigating over the map:

A demo how to use RapidEarth can be found in the following video:

Come on and try it out yourself via the following link.

References

2023

  1. SIGSPATIAL.png
    Rapid-Earth: A Search-by-Classification Engine for Large-Scale Geospatial Imagery
    In Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, SIGSPATIAL (CORE Rank A) , 2023