Integration of geospatial data based on the application of the JOIN operation of relative algebra

Authors:
1
Kyiv National University of Construction and Architecture

The purpose of this work is to study the integration of sets of core reference and thematic geospatial data based on the JOIN operation of relational algebra and its interaction with geocoding of geospatial features, which is implemented in modern geographic information systems (GIS) and database management systems (hereinafter – DBMS) for the development of the national spatial data infrastructure (hereinafter – NSDI). Method. The research is based on the analysis of the possibilities of applying the theory of geospatial databases and knowledge bases, international and national harmonized standards in the field of Geographic Information/ Geomatics to solve the problem of integration of geospatial data using the operation JOIN relational algebra in object-relational database management systems (OR DBMS). Results. The paper examines the models of the Join operation of relational algebra, which underlie the geocoding of features and the creation of electronic gazetteers, and proves its effectiveness: the Join operation integrates of core reference and thematic geospatial datasets. There is a need to define the required geographic identifiers, which must be present among the attributes of the core reference and thematic geospatial datasets to perform the join. The variety of uses of the Join operation covers all possible cases that arise in their practical application. Thus, the use of the Join operation involves identifying these required geographic identifiers at the geospatial database design stage. In particular, it is expedient to determine mandatory geographical identifiers (codes) of features according to the official national systems of features classification (codification) in the relevant sectoral thematic registers, which are responsible for certain holders of thematic data in accordance with Annex 2 of the Decree of Cabinet of Ministers “The order for the functioning of the national spatial data infrastructure” of May 26, 2021, № 532. Scientific novelty and practical significance. The integration of core reference data and thematic geospatial datasets based on JOIN operation models of relational algebra and their interaction with geocoding of geospatial features is researched, which is implemented in modern GIS and DBMS for the development of national spatial data infrastructure. The research was performed on a set of core reference spatial data, namely: information on administrative-territorial units of the Cherkasy region, including their borders; the data from the statistical bulletin of the socio-economic situation of the Cherkasy region for January 2021 of the Main Department of Statistics in Cherkasy region of the State Statistics Service of Ukraine were selected as thematic data. It has been shown that relational algebra join (JOIN) operations can be used to integrate other thematic geospatial data with core reference data using geographic identifiers that contain these datasets.

  1. Bhattacharya, D., & Painho, M. (2017). Smart cities intelligence system (smacisys) integrating sensor web with spatial data infrastructures (sensdi). ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences, 4, 21-28. https://run.unl.pt/handle/10362/28046
  2. Bui, D. & Glushko, I. (2015). Expansion of the signature of Codd’s relational (table) algebras: current state NaUKMA Research Papers. Computer Science (177), 95-107. (in Ukrainian).
  3. Classifier of objects of administrative-territorial organization of Ukraine. (in Ukrainian). http://www.ukrstat.gov.ua/klasf/st_kls/op_koatuu_2016.htm.
  4. Codd, E. F. (1990). The relational model for database management: version 2. Addison-Wesley Longman Publishing Co., Inc.
  5. Connolly, Thomas, & Caroline Begg (2003). Database. Design, implementation and maintenance. Theory and Practice. Moscow: Williams, 2003. 1436 p., 145-149. https://doi.org/10.1007/978-1-4302-5192-7_2
  6. Decree to the Cabinet of Ministers of Ukraine “On Approval of the Procedure for the Functioning of the National Geospatial Data Infrastructure” No. 532 dated May 26, 2021. https://zakon.rada.gov.ua/laws/show/532-2021-%D0%BF#Text
  7. ESRI’s Geodatabase Website. URL: http://support.esri.com/datamodels (дата звернення: 20.04.2022).
  8. ESRI’s Download Website. URL: http://www.esri.com/data/download/census2000_tigerline/index.html (дата звернення: 20.04.2022).
  9. Franci, F., Lambertini, A., & Bitelli, G. (2014, August). Integration of different geospatial data in urban areas: a case of study. In Second International Conference on Remote Sensing and Geoinformation of the Environment (RSCy2014) (Vol. 9229, p. 92290P). International Society for Optics and Photonics. https://doi.org/10.1117/12.2066614
  10. Gao, D., Jensen, C. S., Snodgrass, R. T., & Soo, M. D. (2005). Join operations in temporal databases. The VLDB Journal, 14(1), 2-29. https://link.springer.com/article/10.1007/s00778-003-0111-3
  11. Geocoding: Longitude and Latitude by Address. URL: https://gisgeography.com/geocoding/ (дата звернення: 20.04.2022).
  12. Geographic information. Spatial referencing by geographic identifiers] (2017). DSTU ISO 19112-2017(ISO 19112:2003, IDT) from 1d October 2019. Kyiv. DP «UkrNDNTs» (in Ukrainian).
  13. Geoportal “Administrative and territorial organization of Ukraine”. (in Ukrainian). http://atu.gki.com.ua/
  14. Glushko, I. (2013). Calculation and extension of signatures of tabular algebras. (PhD dissertation). Available from Taras Shevchenko National University of Kyiv. URL: (in Ukrainian). http://csc.knu.ua/uk/library/dissertations/hlushko.pdf.
  15. Hansen, H. S. (1999, April). Integrating digital maps and administrative registers-Danish experiences. In 21 st Urban Data Management Symposium (pp. 21-23).
  16. How to Geocode in ArcMap. URL:
    https://libraries.mit.edu/files/gis/geocoding.pdf (дата звернення: 20.04.2022).
  17. Karpinsky, Y., & Lazorenko-Hevel, N. (2018). The methods of geospatial data collection for topographic mapping. Modern achievements of geodesic science and industry. (in Ukrainian). http://gki.com.ua/ua/metodi-zbirannja-geoprostorovih-danih-dlja-topografichnogo-kartografuvannja
  18. Karpinskyi, Y., Lazorenko-Hevel, N. (2020). The system model of topographic mapping in the national spatial data infrastructure in Ukraine. Geodesy, Cartography and Aerial Photography, 92, 24–36. https://doi.org/10.23939/istcgcap2020.92.024
  19. Karpinskyi, Y., & Lazorenko-Hevel, N. (2020). Topographic mapping in the National Spatial Data Infrastructure in Ukraine. In E3S Web of Conferences (Vol. 171, p. 02004). EDP Sciences. https://doi.org/10.1051/e3sconf/202017102004
  20. Karpinskyi Y., Lazorenko-Hevel N., Kin D. (2020). INSPIREID implementation in the topographic database of the main state topographic map of Ukraine. Веб ISTCGCAP,  91, 20–27. https://doi.org/10.23939/istcgcap2020.91.020 
  21. Karpinskyi, Y. & Lyashchenko A. (2006). Strategia formuvannia natsionalnoi infrastruktury geoprostorovych danych v Ukraini, (108 p.). Kyiv: NDIGK. (Ser. “Geodesy, cartography, cadastre”) (in Ukrainian).
  22. Law of Ukraine About National Geospatial Data Infrastructure from April 13 2020, № 554-IX (2020). Vidomosti Verkhovnoi Rady Ukrainy. Bulletin of Verkhovna Rada of Ukraine. (in Ukrainian).
  23. Lazorenko-Hevel N. (2021). Geographic identifiers as a basis for integration of geospatial data. Mistobuduvannya ta terytorialʹne planuvannya, (78), 312-326. (in Ukrainian).  https://doi.org/10.32347/2076-815x.2021.78.312-326
  24. Lemenkova, P. (2020). Integration of geospatial data for mapping variation of sediment thickness in the North Sea. Scientific Annals of the Danube Delta Institute25, 129-138. https://doi.org/10.7427/DDI.25.14
  25. Lyashchenko, A. & Cherin, A. (2019). Basic models and methods of geospatial data integration in GIS of urban-planning cadastre. Mistobuduvannya ta terytorialʹne planuvannya, (70), 351-365. (in Ukrainian). http://repositary.knuba.edu.ua//handle/987654321/6199
  26. Lyashchenko, A., Havryliuk, Y., & Smilka, V. (2020). Analysis of methods of unique identification of objects in geospatial data sets. Mistobuduvannya ta terytorialʹne planuvannya, (75), 217-232. http://repositary.knuba.edu.ua//handle/987654321/9512 https://doi.org/10.32347/2076-815x.2020.75.217-232
  27. Lyashchenko, A., Karpinskyi Y., Havryliuk, Y. & Cherin, A. (2021). Methods and means of ensuring the interoperability of the components of the national geospatial data infrastructure. Mistobuduvannya ta terytorialʹne planuvannya, (77), 309-319. (in Ukrainian). https://doi.org/10.32347/2076-815x.2021.77.309-319
  28. Mardani, M., Mardani, H., De Simone, L., Varas, S., Kita, N., & Saito, T. (2019). Integration of machine learning and open access geospatial data for land cover mapping. Remote Sensing11(16), 1907. https://doi.org/10.3390/rs11161907
  29. Maksymova Y. (2016). Creating a database of electronic catalog of object classes for sets of profile geospatial data of urban planning documentation. Mistobuduvannya ta terytorialʹne planuvannya, (62 (1)), 367-376. (in Ukrainian). https://repositary.knuba.edu.ua/bitstream/handle/987654321/6932/62a-368-...
  30. Order of the Ministry of Agrarian Policy and Food of Ukraine «On approval of technical requirements for geospatial data, metadata and geoinformation services of the national geospatial data infrastructure» from10.11.2021 № 345. (in Ukrainian).
  31. Pilicheva, M., Kin, D., & Pomortseva, O. (2018). Integration of topographical and cadastral data of the basic dataset of a land parcel. Mistobuduvannya ta terytorialʹne planuvannya, (66), 523-531. (in Ukrainian).
  32. Resolution of the Cabinet of Ministers of Ukraine “On approval of the Order for national topographic and thematic mapping” from 04.09.2013 № 661. (in Ukrainian).
  33. Rayordan, R.  (2001) Relational database fundamentals. М.: Publishing house “Russian edition”.
  34. Shypulin, V. (2021). Integrated real estate information system. Concept for Ukraine: monograph. O. M. Beketov National University of Urban Economy in Kharkiv. (in Ukrainian). http://eprints.kname.edu.ua/57436/
  35. Silberschatz, A., Korth, H. F., & Sudarshan, S. (2002). Database system concepts (Vol. 5). New York: McGraw-Hill. 1376 p. https://snscourseware.org/snsctnew/files/1581236100.pdf
  36. Stankevich, S., Titarenko, O., & Golubov, S. (2021). Mathematical model of integration of heterogeneous data in assessing the oil and gas prospects of territories. Kherson –2021, 86. (in Ukrainian). https://doi.org/10.32782/KNTU2618-0340/2021.4.2.1.23
  37. Statistical bulletin “Socio-economic situation of Cherkasy region”. Main Department of Statistics in Cherkasy Oblast. (2021). (in Ukrainian). http://www.ck.ukrstat.gov.ua/?p=bul_soc_ek.
  38. Sun, K., Zhu, Y., Pan, P., Hou, Z., Wang, D., Li, W., & Song, J. (2019). Geospatial data ontology: the semantic foundation of geospatial data integration and sharing. Big Earth Data3(3), 269-296. https://doi.org/10.1080/20964471.2019.1661662
  39. The National Standard of Ukraine DSTU 8774:2018 “Geographic information. Geospatial data modeling rules”. (in Ukrainian). http://gki.com.ua/ua/prinjatonacionalni-standart-ukraiini-dstu-87742018-geografichna-informacija-pravila-modeljuvannjageoprostorovih-danih