Help:Best Practices: Difference between revisions

From Ruisdael Observatory Data Catalog
 
(9 intermediate revisions by the same user not shown)
Line 14: Line 14:


== Geographic coordinates ==
== Geographic coordinates ==
For datasets produced by stationary instruments (in-situ, remote sensing) and for single-column model outputs, the latitude and longitude coordinates should be entered with a precision of at least 4 digits.
For datasets produced by stationary instruments (in-situ, remote sensing) and for single-column model outputs, the latitude and longitude coordinates should be entered with a precision of at least 4 digits (approx 11 meters). However, the use 5 or 6 significant digits is recommended.  


For non-stationary in-situ and remote sensing observations, use the description field to indicate the measurement domain. For example, the coordinates from the starting and ending points could be mentioned.
For non-stationary in-situ and remote sensing observations, use the description field to indicate the measurement domain. For example, the coordinates from the starting and ending points could be mentioned.


= Description of different submission types =
== Special Characters & Formatting ==
Use the description field to provide all import information from the new dataset. While writing the Description, try to indicate the data source, the content, and the goal of the data. Try to provide enough information to help users understand the dataset content without downloading the data. Below, you can find a non-exhaustive checklist for the different data types to be used as a starting point.
MediaWiki uses Unicode (UTF-8) for character encoding. This allows for a wide range of characters. For more information about this issue, see https://www.mediawiki.org/wiki/Help:Special_characters


== Checklist for in-situ datasets ==
= Dataset description field =
*What is the instrument's name?
Use the description field to provide key details about the dataset, including its source, content, and purpose. A well-written description allows users to understand the dataset without downloading it and improves discoverability. For clarity, longer descriptions can be structured into sections such as 'Purpose,' 'Measurement Specifications,' and 'Data Processing'. Use the checklists below to ensure all essential information is included.
*What is the instrument's model?
*What is the instrument's Brand?
*What is the data resolution?
*What are the variables listed in the dataset?
*Where is it installed?
*What is it intended for?
*If non-stationary instrument: latitude, longitude, height from starting and ending point.


== Checklist for campaigns and products datasets ==
== Checklist for in-situ and remote sensing datasets ==
*What was the campaign goal?
* sensor name, model and brand
*What are the instruments used?
* location, site name and description of surrounding environment
*What is the duration?
* spatial and temporal resolution of measurements
*What are the variables?
* scanning mode(s) (if applicable)
*What is the data resolution?
* sensor calibration details (if applicable)
*If it is a data product, how was it derived?
* physical variables contained in the dataset
*If any external link to the data processing is available, it can also be listed here.
* purpose of the dataset
* context of the experiment/measurement
* for non-stationary instruments: latitude, longitude, height from starting and ending point.


== Checklist model output submission ==
== Checklist for datasets related to campaigns ==
*What model was used?
*What model version was used, and where can it be found?
*What was the model configuration?
*What is the output resolution?
*Is the output from a single column? So, what are the latitude and longitude coordinates?
*Is the output from a spatial domain? So, what is the domain?


==================================================================
* name and goal of the campaign
* duration of the campaign
* other relevant sensors or models that were used during the campaign
* special operating modes or variable resolutions used
* information on how certain data/products were derived
* link(s) to papers, methods or websites that contain more details


= Original Conttent =
== Checklist for model outputs ==
 
* model type, version and configuration
/This page is under construction. Please come back later..
* model grid size(s), domain size and extent
 
* spatial and temporal resolution(s) of output(s)
== General Guidelines ==
* initial conditions, forcing(s) and data assimilation schemes
* Latitude and longitude coordinates for in-situ and remote sensing datasets should be entered with a precision of at least 4 digits.
* Refrain from using Notes. Put important information directly in the description
* Naming convection for the title: Institute - Sensor - Location
 
== Checklist for in-situ data ==
* dataset description should contain sensor type/brand
* dataset description should contain .. (if applicable)
 
== Checklist for remote-sensing data ==
 
== Checklist for model data ==
* The description should explicitly mention the variables that are provided (e.g., temperature, wind speed, etc..)
 
== Campaign data ==
How to enter campaign data?

Latest revision as of 16:20, 3 April 2025

This page describes best practices for adding new datasets and entering metadata into the catalog. In case you think some information is missing, please indicate it on GitHub Ruisdael Data Catalog project.

General guidelines for all submissions

Naming

The first required field for submitting a new dataset is the name. This name is used to identify the new dataset within the Catalog environment. To name the dataset, please use the following convention: Institute Sensor/Campaign/Product/Model at Location.

 Institute Sensor/Campaign/Product/Model at Location

Examples:

Geographic coordinates

For datasets produced by stationary instruments (in-situ, remote sensing) and for single-column model outputs, the latitude and longitude coordinates should be entered with a precision of at least 4 digits (approx 11 meters). However, the use 5 or 6 significant digits is recommended.

For non-stationary in-situ and remote sensing observations, use the description field to indicate the measurement domain. For example, the coordinates from the starting and ending points could be mentioned.

Special Characters & Formatting

MediaWiki uses Unicode (UTF-8) for character encoding. This allows for a wide range of characters. For more information about this issue, see https://www.mediawiki.org/wiki/Help:Special_characters

Dataset description field

Use the description field to provide key details about the dataset, including its source, content, and purpose. A well-written description allows users to understand the dataset without downloading it and improves discoverability. For clarity, longer descriptions can be structured into sections such as 'Purpose,' 'Measurement Specifications,' and 'Data Processing'. Use the checklists below to ensure all essential information is included.

Checklist for in-situ and remote sensing datasets

  • sensor name, model and brand
  • location, site name and description of surrounding environment
  • spatial and temporal resolution of measurements
  • scanning mode(s) (if applicable)
  • sensor calibration details (if applicable)
  • physical variables contained in the dataset
  • purpose of the dataset
  • context of the experiment/measurement
  • for non-stationary instruments: latitude, longitude, height from starting and ending point.

Checklist for datasets related to campaigns

  • name and goal of the campaign
  • duration of the campaign
  • other relevant sensors or models that were used during the campaign
  • special operating modes or variable resolutions used
  • information on how certain data/products were derived
  • link(s) to papers, methods or websites that contain more details

Checklist for model outputs

  • model type, version and configuration
  • model grid size(s), domain size and extent
  • spatial and temporal resolution(s) of output(s)
  • initial conditions, forcing(s) and data assimilation schemes