Javascript must be enabled to continue!
GEOSPATIAL DATA STREAM PROCESSING IN PYTHON USING FOSS4G COMPONENTS
View through CrossRef
Abstract. One viewpoint of current and future IT systems holds that there is an increase in the scale and velocity at which data are acquired and analysed from heterogeneous, dynamic sources. In the earth observation and geoinformatics domains, this process is driven by the increase in number and types of devices that report location and the proliferation of assorted sensors, from satellite constellations to oceanic buoy arrays. Much of these data will be encountered as self-contained messages on data streams - continuous, infinite flows of data. Spatial analytics over data streams concerns the search for spatial and spatio-temporal relationships within and amongst data “on the move”. In spatial databases, queries can assess a store of data to unpack spatial relationships; this is not the case on streams, where spatial relationships need to be established with the incomplete data available. Methods for spatially-based indexing, filtering, joining and transforming of streaming data need to be established and implemented in software components. This article describes the usage patterns and performance metrics of a number of well known FOSS4G Python software libraries within the data stream processing paradigm. In particular, we consider the RTree library for spatial indexing, the Shapely library for geometric processing and transformation and the PyProj library for projection and geodesic calculations over streams of geospatial data. We introduce a message oriented Python-based geospatial data streaming framework called Swordfish, which provides data stream processing primitives, functions, transports and a common data model for describing messages, based on the Open Geospatial Consortium Observations and Measurements (O&M) and Unidata Common Data Model (CDM) standards. We illustrate how the geospatial software components are integrated with the Swordfish framework. Furthermore, we describe the tight temporal constraints under which geospatial functionality can be invoked when processing high velocity, potentially infinite geospatial data streams. The article discusses the performance of these libraries under simulated streaming loads (size, complexity and volume of messages) and how they can be deployed and utilised with Swordfish under real load scenarios, illustrated by a set of Vessel Automatic Identification System (AIS) use cases. We conclude that the described software libraries are able to perform adequately under geospatial data stream processing scenarios - many real application use cases will be handled sufficiently by the software.
Title: GEOSPATIAL DATA STREAM PROCESSING IN PYTHON USING FOSS4G COMPONENTS
Description:
Abstract.
One viewpoint of current and future IT systems holds that there is an increase in the scale and velocity at which data are acquired and analysed from heterogeneous, dynamic sources.
In the earth observation and geoinformatics domains, this process is driven by the increase in number and types of devices that report location and the proliferation of assorted sensors, from satellite constellations to oceanic buoy arrays.
Much of these data will be encountered as self-contained messages on data streams - continuous, infinite flows of data.
Spatial analytics over data streams concerns the search for spatial and spatio-temporal relationships within and amongst data “on the move”.
In spatial databases, queries can assess a store of data to unpack spatial relationships; this is not the case on streams, where spatial relationships need to be established with the incomplete data available.
Methods for spatially-based indexing, filtering, joining and transforming of streaming data need to be established and implemented in software components.
This article describes the usage patterns and performance metrics of a number of well known FOSS4G Python software libraries within the data stream processing paradigm.
In particular, we consider the RTree library for spatial indexing, the Shapely library for geometric processing and transformation and the PyProj library for projection and geodesic calculations over streams of geospatial data.
We introduce a message oriented Python-based geospatial data streaming framework called Swordfish, which provides data stream processing primitives, functions, transports and a common data model for describing messages, based on the Open Geospatial Consortium Observations and Measurements (O&M) and Unidata Common Data Model (CDM) standards.
We illustrate how the geospatial software components are integrated with the Swordfish framework.
Furthermore, we describe the tight temporal constraints under which geospatial functionality can be invoked when processing high velocity, potentially infinite geospatial data streams.
The article discusses the performance of these libraries under simulated streaming loads (size, complexity and volume of messages) and how they can be deployed and utilised with Swordfish under real load scenarios, illustrated by a set of Vessel Automatic Identification System (AIS) use cases.
We conclude that the described software libraries are able to perform adequately under geospatial data stream processing scenarios - many real application use cases will be handled sufficiently by the software.
Related Results
Free geospatial software and applications
Free geospatial software and applications
This chapter examines the significant impact of Free and Open Source Software (FOSS) on the geospatial industry, tracing its development from the 1980s to its current widespread ad...
Geospatial Intelligence: Mapping the Future
Geospatial Intelligence: Mapping the Future
Abstract: Geospatial intelligence (GEOINT) is a multidisciplinary field that combines geographic information systems (GIS), remote sensing, and data analysis to provide critical i...
Basic and Advance: Phython Programming
Basic and Advance: Phython Programming
"This book will introduce you to the python programming language. It's aimed at beginning programmers, but even if you have written programs before and just want to add python to y...
Lahar simulation using Laharz_py program for the Mt. Halla volcano, Jeju, Korea
Lahar simulation using Laharz_py program for the Mt. Halla volcano, Jeju, Korea
This study using Laharz_py program, was performed schematic prediction
on the impact area of lahar hazards at the Mt. Halla volcano, Jeju
island volcanic field, Korea. In order to ...
Distributed Geospatial Information Systems Challenges and Opportunities
Distributed Geospatial Information Systems Challenges and Opportunities
The chapter titled “Distributed Geospatial Information Systems Challenges and Opportunities” delves into the comprehensive landscape of distributed geospatial technologies and thei...
Along-stream evolution of Gulf Stream volume transport and water properties from underwater glider observations
Along-stream evolution of Gulf Stream volume transport and water properties from underwater glider observations
<p>The Gulf Stream is the western boundary current in the subtropical North Atlantic and a principal component of the upper limb of the Atlantic Meridional Overturnin...
Cyber Security Implementation for Application of Geospatial Data
Cyber Security Implementation for Application of Geospatial Data
Geospatial information is often seen as just being connected with guides, compasses, and areas. In any case, the application areas of geospatial information are far more extensive ...
Geospatial Data: Acquisition, Applications, and Challenges
Geospatial Data: Acquisition, Applications, and Challenges
In the current era of big data, geospatial data plays a pivotal role in a wide spectrum of scientific and applied fields related to the Earth’s surface. Advances in sensor technolo...

