GISBox

HDF (Hierarchical Data Format)

GISBox is a one-stop 3D GIS data editing, conversion and publishing platform that supports editing in multiple GIS formats such as OSGB/GEOTIFF/RVT, converting to 3DTiles/Terrain and publishing.

Introduction

HDF (Hierarchical Data Format) is a file format used to store and manage large scientific data sets. It is a container for multidimensional arrays and complex data structures, and is widely used in science, engineering, and research. It can store many types of data, including numerical values, images, audio, text, etc. It supports hierarchical structures, allowing data sets to be stored and accessed in a multi-level organizational manner, facilitating data organization, retrieval, and analysis.

Data Format

HDF files use a hierarchical data management structure, which consists of a directory and a data object set. Various information can be directly obtained from nested files through the overall directory structure.

Pros

1. Self-describing: HDF files are self-describing. Each data object in the file contains comprehensive information about the data (metadata), which allows applications to interpret the structure and content of HDF files without external information.

2. Diversity: HDF files can contain multiple types of data, such as raster image data, scientific data sets, information description data, etc. This data structure facilitates information extraction.

3. Flexibility: HDF allows users to group related data objects together into a hierarchical structure and add descriptions and tags to data objects. At the same time, users can also store scientific data in multiple HDF files.

Cons

1. Access latency: Although HDF files perform well in storing and distributing scientific data, in some cases, it may not be suitable for application scenarios that require low-latency data access.

2. Small file storage: For a large number of small files, HDF may not be the best choice because the NameNode stores the metadata of the file system in memory, and the storage capacity of the file system is limited by the memory capacity of the NameNode.

3. Write restrictions: HDFS (Hadoop Distributed File System, a distributed file system) is similar to HDF in some aspects, but HDFS does not support concurrent writes and random file modifications, which may limit some application scenarios. However, it should be noted that this shortcoming is more for HDFS rather than the HDF file itself, because HDF files do not directly involve the underlying distributed file system implementation. But in actual applications, if you need to modify data frequently or write concurrently, you may need to consider other more suitable file formats or storage systems.

Application Scenario

HDF is suitable for storing and managing large-scale, complex scientific data, especially in scientific research, data analysis and visualization, and is widely used in fields such as astronomy, earth science, biomedicine, meteorology, etc. It is used to store and share large-scale scientific data sets.

Example

  1. HDF file format example.

  1. Meteorological data in HDF format.

File Opening Mode

  1. Open the HDF file in HDFView.

Related GIS files

HDF

GRIB

MDB

TAB

References

  1. https://en.wikipedia.org/wiki/Hierarchical_Data_Format
  2. https://fileinfo.com/extension/hdf
  3. https://www.neonscience.org/resources/learning-hub/tutorials/about-hdf5