Hakai Dataset Submission Below are listed all the different steps

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Thanks <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-u

As <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Dataset Submission: HakaiPruthMooringProvisional about hakai-datasets HOT 24 CLOSED

hakaiinstitute commented on June 20, 2024

Dataset Submission: HakaiPruthMooringProvisional

from hakai-datasets.

Comments (24)

raytula commented on June 20, 2024 1

We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do

Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP.

from hakai-datasets.

raytula commented on June 20, 2024

The real-time data from this node is available through a database view.

Here you can view the daily data.

                          View "sn.PruthMooring:5minuteSamples"
             Column              |           Type           | Collation | Nullable | Default 
---------------------------------+--------------------------+-----------+----------+---------
 measurementTime                 | timestamp with time zone |           |          | 
 PruthMooring:WaterTemp_0m_QL    | smallint                 |           |          | 
 PruthMooring:WaterTemp_0m_QC    | character varying        |           |          | 
 PruthMooring:WaterTemp_0m_UQL   | integer                  |           |          | 
 PruthMooring:WaterTemp_0m_Med   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Avg   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Min   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Max   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Std   | double precision         |           |          | 
...

from hakai-datasets.

raytula commented on June 20, 2024

@JessyBarrette now that the automated QC configuration has been improved a bit, I'm looking at how to create an ERDDAP dataset for this.

I'm currently thinking that we should share the five minute data, as we did for many of the other datasets mapped from the sensor network database.

However, I not sure how to handle the measurements at different depths properly. That is, can/should we continue to define a set of dataset fields for each measurements (ie. five minute average or median value, UQL level, secondary QC flag) and assign a depth value to each measurement ..... or will we need to handle the measurement at each depth as a separate ERDDAP record? Not sure if I'm explaining this right or not. Maybe we should discuss.

from hakai-datasets.

JessyBarrette commented on June 20, 2024

dataset or datasetS
That is a good question, my first intuition would be to follow what we've done with other datasets and create one unique ERDDAP dataset, perhaps even on a single dataset for both sites (but I guess that may be still a debate with the CIOOS Metadata group), and it may not be simplest if we're interfacing ERDDAP with a view of the database.

Sample Rate
Yes, I think it would be best to just provide the finest sampling rate available. However, I think the tidbits themself are sampled every 10 minutes, while there's some interpolation (maybe something else) to a higher rate of 5min. Likely those stats aren't really meaningful in this specific case since they are related to just a single value every 10 minutes.

Sort variables
I think it would be best to stack each temperature and associated stats and flags to single variables and create a depth variable that describes each record's specific depth. It will definitely make the data easier to handle through erddap like any other datasets. Ideally, I think it would also be good to capture the serial number of the tidbits, I'm not sure if that is captured in the database. All this may be a big issue to deal with by using the database data. It would easy to handle by generating netcdfs, but it may be just because I'm more familiar with the NetCDF erddap datasets.

from hakai-datasets.

raytula commented on June 20, 2024

Thanks @JessyBarrette I was thinking of one ERDDAP dataset.

Re: I think it would be best to stack each temperature and associated stats and flags to single variables and create a depth variable that describes each record's specific depth. It will definitely make the data easier to handle through erddap like any other datasets....

That is the key thing I was wondering about. @n-a-t-e do you know if/how we can transform the results returned from the database to do this? Specifically, the database will return separate columns with measurements at different depths, and we would like the ERDDAP dataset to break it out to provide (time, depth, measurement QC) columns instead -- assuming I understand @JessyBarrette correctly. A representative database view is PruthMooring:5minuteSamples.

from hakai-datasets.

n-a-t-e commented on June 20, 2024

We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do

from hakai-datasets.

JessyBarrette commented on June 20, 2024

…

On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting ***@***.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | [email protected] | (C) (250) 208-7806

from hakai-datasets.

raytula commented on June 20, 2024

Ideally, you would also need to generate a site, lat, long and source file name variable. QU5 has been at the same site the whole time while Pruth moved to another location after one year. I can get the official locations for you.
…
On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting @.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ .
-- Jessy Barrette M.Sc. Marine Instrumentation Specialist Hakai Institute https://www.hakai.org/ | [email protected] | (C) (250) 208-7806

Having the database view populate and return the latitude and longitude may be a good approach, assuming it could populate those fields with different values based on date/time. In a few cases (e.g. KC Buoy) the lat/long are stored in the SN database and can be passed through to ERDDAP. In many other cases, the lat/long are fixed and could be added into the view as such (ie. most terrestrial sensor nodes that don't move).

from hakai-datasets.

JessyBarrette commented on June 20, 2024

ERDDAP recommends to associated latitude and longitude variables to fix values that relate to the site location. For cases where a precise time variable location exist, ERDDAP suggest using the field preciseLat and preciseLon https://coastwatch.pfeg.noaa.gov/erddap/download/setupDatasetsXml.html#cdmTimeSeries This doesn't apply to the tidbit moorings but certainly to the KC Buoy. This is what I'm hoping to do with the CTD profile datasets too and should ideally be a standard within Hakai/CIOOS. This can also be very useful for running QARTOD tests on the data in the future. On Wed, Mar 3, 2021 at 10:12 AM Ray Brunsting <[email protected]> wrote:

…

Ideally, you would also need to generate a site, lat, long and source file name variable. QU5 has been at the same site the whole time while Pruth moved to another location after one year. I can get the official locations for you. … <#m_7596592719505883172_> On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting *@*.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment) <#17 (comment)>>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ . -- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute https://www.hakai.org/ | ***@***.*** | (C) (250) 208-7806 Having the database view populate and return the latitude and longitude may be a good approach, assuming it could populate those fields with different values based on date/time. In a few cases (e.g. KC Buoy) the lat/long are stored in the SN database and can be passed through to ERDDAP. In many other cases, the lat/long are fixed and could be added into the view as such (ie. most terrestrial sensor nodes that don't move). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOOHA5YWSNP65KTTGMDTBZ32TANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | [email protected] | (C) (250) 208-7806

from hakai-datasets.

n-a-t-e commented on June 20, 2024

As @JessyBarrette is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be

from hakai-datasets.

raytula commented on June 20, 2024

As @JessyBarrette is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be

Can that be done based on time? (ie. like with the Pruth Mooring where the mooring was relocated after the first year). If so, cool.

from hakai-datasets.

JessyBarrette commented on June 20, 2024

Only if the lat long are the same for all the data of that dataset though I would think. It wouldn't work if we combine multiple sites.

…

On Wed, Mar 3, 2021 at 10:29 AM Nate ***@***.***> wrote: As @JessyBarrette <https://github.com/JessyBarrette> is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOOBUVJ3KUQ2BHQQ3YLTBZ5WXANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | [email protected] | (C) (250) 208-7806

from hakai-datasets.

n-a-t-e commented on June 20, 2024

@JessyBarrette oh right, forgot that you are combining sites.

I created a view sn_unpivot."PruthMooring:5minuteSamples" with columns: measurementTime, depth, watertemp_ql, watertemp_qc, watertemp_uql, watertemp_med, watertemp_avg, watertemp_min, watertemp_max, watertemp_std. We can create a test ERDDAP dataset based on this and the set it up to auto-recreate as needed

from hakai-datasets.

raytula commented on June 20, 2024

Nice. I tried our the following queries and it worked well.

hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01';
hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01' and depth=10;
hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01' and depth=10;

As a general approach, perhaps we should maybe put the database view definitions in this repo and automatically run them when the sensor network database changes....like you are doing now. I can imagine a bunch of situations where explicitly defined database views will work better than auto-generated. I'm not sure about performance/turning, but I imagine you can do things to address any performance issues.

from hakai-datasets.

JessyBarrette commented on June 20, 2024

Here's the moorings location over time

Site	Start time	End Time	Latitude	Longitude	Latitude degrees north	Longitude degrees east
Pruth	2017-12-03 00:00:00PST	2018-11-15 15:00:00 PST	51° 39.3810' N	128° 05.4890' W	51.65635	-128.0914833
Pruth	2018-11-15 15:00:00 PST	onGoing	51°39.126'N	128°05.122'W	51.6521	-128.0853667
QU5M	2018-10-05 00:00:00 PST	onGoing	50°07.20082'N	125°12.7317'W	50.12001367	-125.212195

from hakai-datasets.

JessyBarrette commented on June 20, 2024

@raytula Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate?

from hakai-datasets.

raytula commented on June 20, 2024

@raytula Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate?

Hmm. Not sure. I would be inclined to keep them separate, so let's start with that. Nate will be putting the related database views in this repo and working with you to define the datasets.xml fragments to get those views integrated into ERDDAP.

If we were to combine them, would it make sense to bundle it with the Provisional River's Inlet mooring data?

from hakai-datasets.

JessyBarrette commented on June 20, 2024

I would keep it separate from rivers inlet since the sensors havea different accuracy. Ok let's make two separate for now Le lun. 8 mars 2021 2:45 p.m., Ray Brunsting <[email protected]> a écrit :

…

@raytula <https://github.com/raytula> Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate? Hmm. Not sure. I would be inclined to keep them separate, so let's start with that. Nate will be putting the related database views in this repo and working with you to define the datasets.xml fragments to get those views integrated into ERDDAP. If we were to combine them, would it make sense to bundle it with the Provisional River's Inlet mooring data? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOLZCATICIBQCO6O37LTCVHRXANCNFSM4YPWPDHQ> .

from hakai-datasets.

JessyBarrette commented on June 20, 2024

I created a metadata record for this dataset:
https://cioos-siooc.github.io/metadata-entry-form/#/en/pacific/tV5qE0aUgaOjSVmgPgiZ6MyHuSy1/-MW079KN2Z_OrMh7Zk6T

from hakai-datasets.

raytula commented on June 20, 2024

Looks good to me.

from hakai-datasets.

JessyBarrette commented on June 20, 2024

The metadata form was updated to reflect the provisional aspect

from hakai-datasets.

JessyBarrette commented on June 20, 2024

ERDDAP dataset is looking good:
https://goose.hakai.org/erddap/tabledap/HakaiPruthMooringProvisional.graph?time%2Cdepth%2Cwatertemp_avg&time%3E=2017-10-11T00%3A00%3A00Z&time%3C=2021-03-18T00%3A00%3A00Z&.draw=markers&.marker=2%7C5&.color=0x000000&.colorBar=Rainbow%7C%7C%7C6%7C14%7C&.yRange=-1%7C65%7Cfalse%7C&.bgColor=0xffccccff

Mostly need now to add more documentation to the dataset.xml

from hakai-datasets.

raytula commented on June 20, 2024

Yeah, nice to see that graph. I kind of feel like ERDDAP is partly a much more functional version of the Hakai sensor network tool --- with neither being super easy to use without spending some time to explore and perhaps have someone prepare some great examples.

from hakai-datasets.

JessyBarrette commented on June 20, 2024

This dataset is now available online. We can close that issue!

from hakai-datasets.

Dataset Submission: HakaiPruthMooringProvisional about hakai-datasets HOT 24 CLOSED

Comments (24)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs