Help about search

Every search action implies the execution of a query on the whole set of Dat@OSU's records.

A query to Dat@OSU is interpreted as a logical expression. This query may be formulated in the search bar thanks to a query language, presented below.

All records satisfying the query condition are displayed. It is then possible to refine the query thanks to facet search (accessible on the left) by checking further criteria which will be added to the initial query.

Starting a search

Full text search: the condition is simply composed of one or several words. These words are searched in all fields of Dat@OSU's records.

Selector search: a selector may be used to search a word in a definite field. These fields correspond to elements of Dat@OSU's metadata profile.

Multi-criteria search: full text or selector search criteria may be combined using logical operators.

Using logical operators

The “and” operator is indicated by the “&” symbol. This operator is used by default when no operator is specified. As in mathematics, the “and” operator has priority upon the “or” operator.

The “or” operator is indicated by the “;” and “|” symbols (the latter is the “pipe” symbol, not the uppercase “i” character).

The “not” operator is indicated by the “!” symbol. It allows negating a condition. For example, specifying “astronomy” returns all records with the word “astronomy”in a field; specifying “!astronomy” returns all records without the word “astronomy” in any of their fields.

Parenthesis may be used to constrain the order of operations with the same conventions than in mathematics.

Full text search

It is possible to search for one or several words in the whole set of fields of the data base. The search is performed independently of the word number (singular or plural) for words which have a plural with an -s suffix. When results of the query are displayed, exact occurrences are highlighted using a different colour for each searched word.

Example of full text search. The words you search for are highlighted.

Selectors

A selector can be used to search for a word in a given field. These fields correspond to elements in the Dat@OSU's metadata profile.

The format of the query is: selector_name:value

It is a common use to surround the value with quotes to avoid the use of escape characters. Some among the selectors have shorter-form synonyms.

access

Selection on data access type.

Possible values are “open” or “restricted” when one needs to get in touch with the contact person to possibly gain access to the data.

audience

Returns all records with the specified audience level.

Possible values are: “General”, “Elementary Education”, “Middle school Education”, “Community College”, “University: licence”, “University: master”, “Research”, “Stakeholder”, “Policy maker”, “Informal Education”, “Amateur”.

collection

Returns all records contained in the specified collection; the exact name of the collection must be specified. When the name of the collection contains spaces or special characters, it must be surrounded by quotes or escape characters must be used (see below).

contributor

(synonym: contrib)

Selects records one of the contributors of which corresponds to the specified name. It can be a person or a structure, indicated as creator, contributor or contact. As for persons, names can be supplied under the form “first_name surname”, “surname first_name”, “surname” or “first_name”. As for structures, either the complete name or the acronym can be used.

createdsince

(synonym: created)

Selects records which have been created since the specified date. The date format is “year-month-day”, for instance “2017-04-01” for April 1st 2017. To search for records with a creation date older than a given date, it is possible to use the “!” operator to reverse the selection.

creator

(synonyms: crea, créa)

Selects records one of the creators of which corresponds to the specified name. It can be a person or a structure, indicated as creator. As for persons, names can be supplied under the form “first_name surname”, “surname first_name”, “surname” or “first_name”. As for structures, either the complete name or the acronym can be used.

discipline

(synonym: dis)

Selects records whose one of the associated disciplines corresponds to the specified name (list stemming from OST's list of disciplines).

  • discipline:ecology
    The records for which one of the disciplines is ecology.

domain

(synonym: dom)

Selects records for which the grand discipline domain corresponds to the specified name from OST's list of disciplines: “fundamental biology", "medical research", "applied biology - ecology", "chemistry", "physics", "sciences of the universe", "engineering science", "mathematics", "humanities", "social science", "others, unassigned".

issuedafter

(synonym: issued)

Selects records relevant to data which have been published since the specified date. Format is year-month-day (e.g. 2017-04-01 for April 1st 2017). To search records updated before a specified date, use the “negation” (“!”) operator to reverse the selection.

  • issuedafter:2017-01-15
    The records about data which have published since January 15th 2017
  • !issuedafter:2017-01-15
    The records about data which have published before January 15th 2017

keyword

(synonym: kw)

Selects records which have the specified keyword.

In case of homonymous keywords in several thesaurus, the search of a keyword in a given thesaurus is performed is specifying the name of the keyword followed by ":", followed by the name of the thesaurus (the value can be 'AGROVOC”, “Catalogue of Life”, “CCDC”, “EnvThes”, “GEMET”, “InChI”, “MeSH”, “NASA Thesaurus”, “Open Data Handbook”, “PACTOLS”, “PhySH”, “TAXREF”, “TGN”, “UAT”, “Wikimonde”).

  • kw:comets
    The records whose one of the keywords is "comets"
  • kw:dust:GEMET
    The records whose one of the keywords is "dust" in the GEMET thesaurus
  • kw:dust:UAT
    The records whose one of the keywords is "dust" in the UAT thesaurus

label

Selects records corresponding to the specified label.

  • label:ZAAJ
    The records which have label "Zone Atelier Arc Jurassien" (ZAAJ)

lang

Selects the records according to the language of the data. It is possible to either use the full name of the language or the ISO 639-2 code (fre, eng …).

  • lang:french
    The records for which the data are in French.

method

Selects the records relevant to data obtained with the specified method. Values can be: "Observational data", "Derived or compiled data", "Experimental data", "Simulation or computational data", "Reference data".

  • method:"Reference data"
    Records about reference data (notice the use of quotes which surround the search value to account for the space character)
  • method:Reference\ data
    Records about reference data (same as above, but without the use of quotes, escaping the space character the backslash).

obsolete

Selects all records with the specified level of obsolescence. To select records about obsolete data, it is possible to specify "yes", "oui", "y", "o", "obsolete", "obsolète" or "1" as value. Similarly, it is possible to use "no", "non", "n", "current", "actuel" or "0" to select records about non obsolete data.

project

Selects records corresponding to the specified project. A project is identified by its name.

  • project:mustardijon
    returns all records of the "MUSTARDijon" project

publisher

(synonym: pub)

Selects records describing data published by the specified structure. It is possible to use either the name or the acronym of the structure to perform the search.

  • publisher:Chrono-environnement
    selects all records describing data published by the Chrono-environment laboratory
  • publisher:IUCr
    selects all records describing data published by the International Union of Crystallography
  • publisher:CDS
    selects all records describing data hosted by the Strasbourg astronomical data Centre (CDS)

structure

(synonym: struct)

Selects records related to a given structure (laboratory, etc.). This structure may be creator, contributor or publisher. A record will be also selected if a person affiliated to this structure is a creator, a contributor or a contact.

  • structure:UTINAM
    selects all records describing data for which the UTINAM Institute is either publisher, creator or contributor or one of its members is either creator, contributor or contact of these data

type

Selects records corresponding to the data type specified. Values can be either "dataset” or "database”.

updatedsince

(synonym: since)

Selects records updated since the specified date. It the record has never been updated, then the creation date is used. The date format is "year-month-day", e.g. "2017-04-01" for the April 1st 2017. To search records updated before a specified date, just use the “negation” (“!”) operator to reverse the selection.

  • updatedsince:2017-06-01
    Records updated or created since June 1st 2017.
  • since:2017-06-01
    Records updated or created since June 1st 2017 (same as above, but using “since” which is a synonym of "updatedsince").

updateperiod

Selects records having a data update period corresponding to the specified value. It is possible to use either the periodicity full name (e.g. “as needed”) or its ISO 19115 MD_MaintenanceFrequencyCode code (e.g. “asNeeded”).

Possible values are: "annually", "biannually", "quarterly", “monthly", "fortnightly", “weekly”, “daily”, “continual”, “irregular”, “notPlanned”, “noUpdate”, “asNeeded”, “unknown”

uri

(synonym: url)

Selects records about open data having an access URI comprising the specified value. Other URIs in descriptions and other fields are not taken into account.

  • uri:obs-besancon.fr
    Records about data with the string "obs-besancon.fr" contained in the internet address (uri).

Multicriterial search

Using selectors and logical operators, it is possible to perform complex multicriterial searches. Here are a few examples:

  • structure:utinam & access:open
    Records about open access data related to the UTINAM Institute.
  • (structure:Biogéosciences | structure:Chrono-environnement | structure:LICB | structure:THETA | structure:UTINAM) lang:English
    Records about English language data from the Biogéosciences, Chrono-environnement, LICB, THETA and UTINAM structure.
  • structure:Chrono-environnement !collection:PollenChrono
    Records about data related to the "Chrono-environnement" laboratory and not belonging to the "PollenChrono" collection.
  • discipline:ecology audience:Research createdsince:2017-01-01
    Records about data in the "ecology" scientific discipline with audience "Research" and created after January 1st 2017.
  • publisher:Chrono-environnement kw:ecotoxicology
    Records about data published by the "Chrono-environnement" laboratory with "ecotoxicology" as one of their keywords.
  • discipline:"environmental sciences" access:restricted
    Records about data in the "environmental sciences" scientific discipline with restricted access.
  • discipline:"astronomy & astrophysics" | discipline:"spectroscopy"
    Records about data in the "astronomy & astrophysics" or "spectroscopy" scientific disciplines.
  • label:zaaj !kw:ecotoxicology
    Records about data with the "ZAAJ" (Long-Term Ecosystem Research site Jurassian Arc) label without "ecotoxicology" being one of their keywords.
  • publisher:UTINAM kuiper
    Records about data published by the UTINAM Institute and having the word "kuiper" in one of their fields (full-text search).
  • method:"Derived or compiled data" discipline:"astronomy & astrophysics"
    Records about data issued from processing or from the compilation of raw data in the "astronomy & astrophysics" scientific discipline.

Escape characters

When values used in search criteria contain spaces or special characters, then these need to be escaped.

A string of alphanumeric characters can be supplied “as is”. It is possible to surround a text containing characters such as spaces using simple (') or double (") quotes. To prevent the interpretation of a special character, it is possible to insert the escaping character "\" in front of it.

In the following cases, using an escaping character is mandatory:

Caractère Caractère
échappé
Pas
d’entourage
Entouré par
apostrophe
simple (')
Entouré par
apostrophe
double (")
Éspace\(éspace)X
Tabulation *\tXXX
Nouvelle ligne *\nXXX
\\\XXX
'\'XX
"\"XX
:\:X
(\(X
)\)X
&\&X
|\|X
;\;X

*: not usable with most search criteria

Plain text searches are usable only for words.

Some examples of escaping:

  • Escaping of the space and colon characters:
    audience:"University: master"
    audience:University\:\ master
  • Escaping the space, parenthesis and single quote characters:
    label:"l'OSU (test)"
    label:'l\'OSU (test)'
    label:l\'OSU\ \(test\)
  • Escaping the space and double quote characters:
    collection:"book \"Palaeontology\""
    collection:'book "Palaeontology"'
    collection:book\ \"Palaeontology\"
About
Terms of use
Contact
© OSU-THETA, CNRS