---
title: "Product Information"
output: rmarkdown::html_vignette
vignette: >
  %\VignetteIndexEntry{Product Information}
  %\VignetteEngine{knitr::rmarkdown}
  %\VignetteEncoding{UTF-8}
---

```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)
eval_chunks <- curl::has_internet()
```

Before you start downloading data, you first need to know which
data you need and for which purpose. And not unimportant: you need to know
where to find it. Although data discovery is not the primary objective of
this package, it provides several instruments to obtain product information.

## The Web Browser

Perhaps the easiest way to find products and layers is via the
[online catalogue](https://data.marine.copernicus.eu/products). To
streamline your workflow using the web browser, you should check
`vignette("translate")`. That vignette explains how you can copy a
request from the web browser catalogue and use it in R.

## Listing Products

To get a complete overview of all available products, you can list them with
either `cms_products_list()` or `cms_products_list2()`:

```{r listing-products, message=FALSE, eval=eval_chunks}
library(CopernicusMarine)

cms_products_list() |> head(3)

cms_products_list2() |> head(3)
```

The first returns a `data.frame` complete with al sorts of meta information
about the type of variables in the product, the spatio-temporal coverage,
number of vertical layers, etc. You can use this `data.frame` to narrow your
search by applying a `dplyr::filter()` on it.

You can even pass arguments that are used to search the online catalogue.
These are not well documented. The example below shows how to search for
free text and filter on area and variables:

```{r searching-products, message=FALSE, eval=eval_chunks}
cms_products_list(freeText = "wave",
                  facetValues = list(areas             = list("Europe"),
                                     specificVariables = list("Velocity")))
```

The reason this is poorly documented is because this function does not
use the formal API. Instead it uses the web-form used by the online catalogue.
Users should therefore not rely on it too much as it may get discontinued or
altered at any time.

Instead, users can use `cms_products_list2()` which produces a list of products,
by using the official API. Unfortunately, this list does not contain any additional
information. For this purpose users can refer to `cms_product_metadata()`,
`cms_product_details()`, or `cms_product_services()` as described below.

## Product Details

With `cms_product_details()` you will get some descriptive information about your
product. Nothing too fancy, but it will help you understand what the product
is all about.

```{r product-details, eval=eval_chunks}
cms_product_details("GLOBAL_ANALYSISFORECAST_PHY_001_024") |> summary()
```

## Product Metadata

When subsetting  a dataset with `cms_download_subset()`, the most
tricky thing is discovering what the available ranges are for its dimensions.
You can use `cms_product_metadata()` for this purpose. It returns a named list.

```{r product-metadata, eval=eval_chunks}
meta_info <- cms_product_metadata("GLOBAL_ANALYSISFORECAST_PHY_001_024")

## Get the dimension properties for the first layer in this product
meta_info$properties[[1]]$`cube:dimensions` |> summary()
## Get the variable properties for the first layer in this product
meta_info$properties[[1]]$`cube:variables` |> summary()
```

Another way to get the dimension ranges is by setting up a stars proxy
object (see `vignette("proxy")`) and call `st_dimensions()` on it:

```{r product-proxy, warning=FALSE, eval=eval_chunks&&CopernicusMarine:::has_blosc}
library(stars) |> suppressMessages()

myproxy <- cms_zarr_proxy(
    product       = "GLOBAL_ANALYSISFORECAST_PHY_001_024",
    layer         = "cmems_mod_glo_phy-cur_anfc_0.083deg_P1D-m",
    asset         = "timeChunked")
st_dimensions(myproxy)
```

## Product Services

If you want a more raw access point to your data, you can use `cms_product_services()`.
It will present a `data.frame` for your product with all services provided by
Copernicus. You can use any of the columns with the `"_href"` suffix to get an
URL of a specific service. If you want to access those directly, you are on your
own. It is easier to use any of the wrappers provided by this package to access
the data.

```{r services, eval=eval_chunks}
cms_product_services("GLOBAL_ANALYSISFORECAST_PHY_001_024")
```
