NATRE vs CTD-χpod

NATRE vs CTD-χpod#

I’m pretty sure that the CTd-χpod data needs more cleanup, but there are some interesting sampling questions explored in this notebook.

What happens if I just use single latitude or single longitude lines from the original NATRE dataset?
How do number of obs compare?
Is there some finestructure metric that tells me if we are seeing eddy stirring?

The autoreload extension is already loaded. To reload it, use:
  %reload_ext autoreload

/Users/dcherian/mambaforge/envs/eddydiff/lib/python3.9/site-packages/statsmodels/compat/pandas.py:65: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
  from pandas import Int64Index as NumericIndex

geoviews   : 1.9.5
xarray     : 2022.6.0
dcpy       : 0.2.1.dev11+gebbbf87.d20220428
holoviews  : 1.14.8
xgcm       : 0.6.1
scipy      : 1.8.0
cf_xarray  : 0.7.1.dev16+g7305fd8
hvplot     : 0.7.3
pint_xarray: 0.2.1
distributed: 2022.4.0
matplotlib : 3.5.1
tqdm       : 4.64.0
gsw        : 3.4.0
eddydiff   : 0.1
dask       : 2022.4.0
numpy      : 1.22.3

<xarray.DataArray (dim_0: 1)>
array([1.])
Dimensions without coordinates: dim_0

/Users/dcherian/mambaforge/envs/eddydiff/lib/python3.9/site-packages/distributed/node.py:177: UserWarning: Port 8787 is already in use.
Perhaps you already have a cluster running?
Hosting the HTTP server on port 61590 instead
  warnings.warn(

Client

Client-70461802-5095-11ed-a6bd-3af9d394f1c6

Connection method: Cluster object	Cluster type: distributed.LocalCluster
Dashboard: http://127.0.0.1:61590/status

Cluster Info

LocalCluster

d3c44a59

Dashboard: http://127.0.0.1:61590/status	Workers: 6
Total threads: 6	Total memory: 16.00 GiB
Status: running	Using processes: True

Scheduler Info

Scheduler

Scheduler-e94c491f-83df-47b6-8785-f3f33d668e3f

Comm: tcp://127.0.0.1:61591	Workers: 6
Dashboard: http://127.0.0.1:61590/status	Total threads: 6
Started: Just now	Total memory: 16.00 GiB

Workers

Worker: 0

Comm: tcp://127.0.0.1:61621	Total threads: 1
Dashboard: http://127.0.0.1:61626/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61597
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-5o5_x_fw

Worker: 1

Comm: tcp://127.0.0.1:61612	Total threads: 1
Dashboard: http://127.0.0.1:61617/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61595
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-stmte294

Worker: 2

Comm: tcp://127.0.0.1:61614	Total threads: 1
Dashboard: http://127.0.0.1:61620/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61599
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-qv754dvy

Worker: 3

Comm: tcp://127.0.0.1:61613	Total threads: 1
Dashboard: http://127.0.0.1:61615/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61594
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-48ix56e0

Worker: 4

Comm: tcp://127.0.0.1:61625	Total threads: 1
Dashboard: http://127.0.0.1:61627/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61596
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-erkh1o_k

Worker: 5

Comm: tcp://127.0.0.1:61616	Total threads: 1
Dashboard: http://127.0.0.1:61623/status	Memory: 2.67 GiB
Nanny: tcp://127.0.0.1:61598
Local directory: /Users/dcherian/work/eddydiff/notebooks/dask-worker-space/worker-nqgo9r1j

NATRE $ε$ vs $ε_χ$#

fRρ = (1 - 2 / natre_.Rρ + 1 / natre_.Rρ**2).where(np.abs(natre_.Rρ) > 1)
α = gsw_xarray.alpha(natre_.SA, natre_.CT, natre_.pres)
fN = (-9.81 * α * natre_.Tz / natre_.N2) ** 2

with plt.rc_context({"lines.linewidth": 0.75}):
    natre_.eps.mean("cast").cf.plot(xscale="log", color="k")
    natre_.eps_chi.mean("cast").cf.plot(xscale="log")
    (fN * natre_.eps_chi * fRρ).mean("cast").cf.plot(xscale="log")

_images/f3883dc433ee0a3180960c917daa3946e1d8f2ca4310af7ea90e4de6f00b122b.png

Turner angle / Rρ diagnostics#

TODO#

[x] Issues with dTdz for Γ? How do I reproduce Aurelie’s dThdz?
- 100m fitler
- Ashin uses 8m
[x] Do histogram of Rρ, Γ with depth
[ ] What is the outlier ε for A05, Rρ ~ 4

Calculation of Tu / Rρ#

Ashin et al (2022):

To estimate〈∂T/∂z, a 10s(∼8 m) low-pass filter was applied to the temperature profiles to remove high-frequency variations, and then the temperature gradients less than the accuracy of the temperature sensor (0.018°C/m) were removed.

St Laurent & Schmitt (1999):

Finestructure gradient quantities, particularly Rr and Ri, will be used extensively in the analysis that follows. To estimate the vertical gradients of scalars, we have used the slope of a linear fit over a 5-m segment, centered at each 0.5-m interval. The 5-m scale was chosen as a suitable trade-off between the need for high vertical resolution and statistically reasonable regression estimation. The magnitudes of all 5-m scalar gradients were compared to their associated standard error. Gradient quantities with standard errors larger than twice their magnitude were excluded from the analysis. … “The gradients were calculated using a 5-m scale, smoothed using a 50-m running average.”

GO-SHIP CTD-χpod

bin to 5m, difference, then run 100m smoother

Read#

13 40

/Users/dcherian/mambaforge/envs/eddydiff/lib/python3.9/site-packages/numba/np/ufunc/gufunc.py:151: RuntimeWarning: invalid value encountered in get_gradient_sign
  return self.ufunc(*args, **kwargs)

0  values == 0
28090

/var/folders/yp/rzgd0f7n5zbcbf5dg73sfb8ntv3xxm/T/ipykernel_75453/1555697229.py:5: DeprecationWarning: Deleting a single level of a MultiIndex is deprecated. Previously, this deleted all levels of a MultiIndex. Please also drop the following variables: {'latitude', 'longitude'} to avoid an error in the future.
  natre.stack({"cast": ["latitude", "longitude"]})
/Users/dcherian/work/python/cf-xarray/cf_xarray/accessor.py:1638: UserWarning: Variables {'ctd_salinity_qc'} not found in object but are referred to in the CF attributes.
  warnings.warn(
/Users/dcherian/work/python/cf-xarray/cf_xarray/accessor.py:1638: UserWarning: Variables {'ctd_temperature_qc'} not found in object but are referred to in the CF attributes.
  warnings.warn(
/Users/dcherian/work/python/cf-xarray/cf_xarray/accessor.py:1638: UserWarning: Variables {'ctd_salinity_qc'} not found in object but are referred to in the CF attributes.
  warnings.warn(
/Users/dcherian/work/python/cf-xarray/cf_xarray/accessor.py:1638: UserWarning: Variables {'ctd_temperature_qc'} not found in object but are referred to in the CF attributes.
  warnings.warn(

Diagnostics#

My takeaways are:

A05 sampled ocean conditions similar to NATRE in terms of Rρ.
We could throw out the small number of diffusive convection points that have really high ε (-90 < Tu < -45). This would not reduce the average ε (see next)
In the doubly stable shear turbulence regime at 1000dbar (-45 < Tu < 45) we still have high ε ~ 1e-7-ish; NATRE has 1e-10

St-Laurent & Schmitt:

Data in the density ratio range between -1 and 0 were held from the analysis, as these data are characterized by weak thermal stratification where $θ_z$ → 0, making $Γ$ singular.

%autoreload

ed.plot.plot_Tu_relationships(a05_, title="A05")

_images/38bb3e2cb0480692805d5c18ea7c81011bfc7d22ef57358f395b1465d464e765.png

ed.plot.plot_Tu_relationships(natre_, title="NATRE")

2022-10-21 12:24:50,897 - distributed.utils_perf - WARNING - full garbage collections took 13% CPU time recently (threshold: 10%)

_images/46afd146ea15dd43d12ada2093a84d5238674993cf760a38b3407868c1f576ab.png

plt.rcParams["figure.dpi"] = 140

natre_.Tu.cf.plot(y="pres", x="cast", cmap=mpl.cm.twilight, vmin=-80, vmax=80)
plt.figure()
a05ctd.Tu.cf.plot(x="profile_id", cmap=mpl.cm.twilight, levels=[45, 90])

<matplotlib.collections.QuadMesh at 0x17f2c3e50>

_images/054cc0cfdc7aa3d4c9f1e455d39e4998900785f94563e4ea23f5bb21f99fd273.png

_images/7cb0297a4f94c0163fb9dbc4cffe90b26f18fb644814a16d3ffa6dc058acc9a1.png

ε-Rρ#

Mean eps is high for Rρ ~ 0, compared to Rρ < 0
There are equal number of points at Rρ~0 and $R_ρ \approx -1$

def plot_Rrho_dependencies(eps, Rρ, title=None):
    gb = eps.groupby_bins(Rρ, bins=np.arange(-5, 5.1, 0.25))
    gb.mean().plot.line(yscale="log")
    ax = plt.gca()

    counts = gb.count().rename("counts")
    counts.attrs.clear()
    ax2 = plt.gca().twinx()
    counts.plot.line(yscale="log", ax=ax2, color="k")
    ax.set_xticks(gb._bins[::4])
    ax.set_title(title)
    ax2.set_title("")

    plt.gcf().set_size_inches(mpl.figure.figaspect(1 / 3))


eps = (
    a05["chipod"]
    .ds["eps"]
    .sel(
        cleaner="none",
        direction="dn",
        station=slice(108, 130),
        pressure=slice(500, 2000),
    )
)
Rρ = (
    a05["ctd"]
    .ds["Rρ"]
    .sel(station=slice(108, 130), pressure=slice(500, 2000))
    .clip(-10, 10)
)
plot_Rrho_dependencies(eps, Rρ, title="A05")

plt.figure()
sub = natre.sel(pres=slice(500, 2000))
plot_Rrho_dependencies(natre.eps, natre.Rρ, title="NATRE")

_images/0c5c9e1a32cbd53fa51f48b9bf415a58218e22190b437608df780c48739294c3.png

_images/dccb625a418868c0065cd544245d3d08d0c7e137f375198752441d759e1db18d.png

pod = a05["chipod"].ds.sel(
    cleaner="none",
    direction="dn",
    station=slice(108, 130),
    pressure=slice(200, 2000),
)
pod.eps.coarsen(pressure=25).mean().mean("station").cf.plot(xscale="log")
pod.eps.where(pod.Rρ < -0.5).coarsen(pressure=25).mean().mean("station").cf.plot(
    xscale="log"
)

[<matplotlib.lines.Line2D at 0x18c7608e0>]

_images/6d3d888e1ff4e248a837b21c980dc7359900aeba6fb52056568892eb6cd79588.png

pod["Tu"] = pod.Tu.copy(data=np.rad2deg(pod.Tu.data))

pod.plot.scatter("Tu", "eps", alpha=0.2, s=2)
plt.gca().set_yscale("log")

_images/9f3d28b325a5f694a8003a75f5589e156af7f5729dbcf31dcd70e3931510704c.png

def plot_Gamma_diagnostics(datasets):
    f, ax = plt.subplots(2, 2, constrained_layout=True)

    for name, data in datasets.items():
        data.Gamma.plot.hist(
            density=True,
            bins=np.arange(-0.05, 2.1, 0.01),
            histtype="step",
            ax=ax[0, 0],
            label=name,
            lw=1.5,
        )

        for axx, var, yscale in zip(
            ax.ravel()[1:], ["Gamma", "eps", "chi"], ["linear", "log", "log"]
        ):
            mean = (
                data[var].groupby_bins(data.Rρ, bins=np.arange(-5, 5.1, 0.25)).median()
            )
            assert mean.ndim == 1
            # display(mean)
            mean.plot(ax=axx, yscale=yscale)

    dcpy.plots.linex(0.2, ax=ax[0, 0])
    dcpy.plots.liney(0.2, ax=ax[0, 1])
    ax[0, 0].legend()
    [axx.set_title("") for axx in ax.ravel()]
    plt.gcf().set_size_inches(mpl.figure.figaspect(1 / 2))


data = a05["chipod"].ds
# a05["chipod"].ds["Gamma"] = (data.chi * data.N2)/(2 * data.eps * data.dThdz**2).where((np.abs(data.dThdz) > 3e-4) & (data.N2 > 1e-6))

plot_Gamma_diagnostics(
    {
        "NATRE": natre.sel(pres=slice(600, 2000)),
        "A05": a05["chipod"].ds.sel(
            cleaner="none",
            direction="dn",
            station=slice(108, 130),
            pressure=slice(600, 2000),
        ),
    }
)

_images/a391f498301375ab059ce59da92e330678d0be9ce961106b8448abafa1d77083.png

Tz vs dThdz#

subset = a05.sel(station=slice(108, 130), pressure=slice(200, 2000))

subset["ctd"].ds.Tz.mean("station").cf.plot()
subset["chipod"].ds.dThdz.mean("station").sel(direction="dn").cf.plot()

[<matplotlib.lines.Line2D at 0x1823729d0>]

_images/994d399132871408e269652500d40bdcbef9f5d3b68cb8377ef145b76d4f14f5.png

Compare to each other#

ed.plot.compare_section_estimates([natre_binned, a05_binned])

_images/e813ce56373328f641c6131e8b59c86467c9bcdd0ad374a0c7b3e5ea3acbbb39.png

Compare individual binned residuals#

ed.plot.debug_section_estimate(natre_binned, KρTz2=True)

_images/ac3679d8b606d5625852397deea43707a61d4edcdf0a673f0aba21bf868d266b.png

ed.plot.debug_section_estimate(a05_binned)

_images/4e07012c3e20606eeb311c1bf6600634d35a602820c94bc2746de056f68a9b4e.png

dcpy.plots.fill_between_bounds(a05_binned, "residual_chi", y="pres")
dcpy.plots.fill_between_bounds(natre_binned, "residual_chi", y="pres", color="k")
dcpy.plots.fill_between_bounds(natre_binned, "residual", y="pres", color="C2")
plt.xscale("log")

_images/c2c9607600f753ed64f644748b3e3229d63dd1cdc0111176a68145563f3c1e09.png

Map : Did A05 sample the “right” place?#

Locate entire cruise on map with NATRE & $|∇^ρS|$ (averaged over 750m-1250m)

$|∇^ρS|$ along 25°N shows a mid-depth maximum as expected (A05 latitude is approximately 24.7°N)

(
    cole.salinity_gradient.sel(lon=slice(360 - 100, 345))
    .sel(lat=24.7, method="nearest")
    .cf.plot(robust=True, size=5, aspect=3)
)

<matplotlib.collections.QuadMesh at 0x109d17c70>

chipod = a05["chipod"].ds
with_chi = chipod.cf[["latitude", "longitude"]].where(
    chipod.chi.count("pressure") > 0  # , drop=True
)
lats, lons = np.meshgrid(natre.latitude, natre.longitude)

dS = (
    cole.salinity_gradient.sel(
        lon=slice(360 - 80, 360), lat=slice(10, 40), pres=slice(750, 1250)
    )
    .mean("pres")
    .hvplot(geo=True)
)
natre_casts = gv.Points((lons.ravel(), lats.ravel()))
allcasts = gv.Points((chipod.cf["longitude"], chipod.cf["latitude"]))
chicasts = gv.Points((with_chi.cf["longitude"], with_chi.cf["latitude"]))
a05_casts = allcasts.opts(size=6, color="cyan") * chicasts.opts(color="k")

(
    # dS.opts(cmap="bmy")
    natre_casts.opts(color="k")
    * a05_casts
    * gf.coastline.opts(line_width=2)
)

Are there any obvious sampling biases in NATRE?#

No. Just a three “bad” casts

counts = natre.chi.cf.sel(Z=slice(2000)).cf.count("Z")
counts

<xarray.DataArray 'chi' (latitude: 10, longitude: 10)>
array([[   1, 3397, 3515, 3079, 3540, 3607, 3510, 3564, 3703, 3715],
       [3468, 3690, 3514, 3528, 3578, 3468, 3495, 3668, 3634, 3661],
       [3564, 3610, 3597, 3512, 3454, 3518, 3517, 3571, 3481, 3684],
       [3472, 3482, 3498, 1545, 3317, 3419, 3436, 3414, 3485, 3684],
       [3481, 3556, 3491, 3569, 3491, 3407, 3369, 3424, 3429, 3689],
       [3490, 3375, 3551, 3492, 3470, 3468, 3462,  697, 3475, 3682],
       [3514, 3493, 3511, 3525, 3648, 3549, 3518, 3419, 3641, 3757],
       [3624, 3509, 3522, 3575, 3627, 3551, 3510, 3458, 3581, 3826],
       [3536, 3543, 3463, 3640, 3616, 3495, 3566, 3562, 3613, 3805],
       [3503, 3139, 3587, 3625, 3621, 3478, 3571, 3639, 3653, 3797]])
Coordinates:
  * latitude   (latitude) float64 27.5 27.1 26.7 26.3 ... 25.1 24.7 24.3 23.9
  * longitude  (longitude) float64 -30.7 -30.3 -29.8 -29.4 ... -27.6 -27.2 -26.8
    time       (latitude, longitude) datetime64[ns] 1992-03-28T15:28:59.99999...
Attributes:
    long_name:      $χ$
    standard_name:  ocean_dissipation_rate_of_thermal_variance_from_microtemp...
    units:          C^2 s^-1

xarray.DataArray

'chi'

latitude: 10
longitude: 10

1 3397 3515 3079 3540 3607 3510 ... 3625 3621 3478 3571 3639 3653 3797

array([[   1, 3397, 3515, 3079, 3540, 3607, 3510, 3564, 3703, 3715],
       [3468, 3690, 3514, 3528, 3578, 3468, 3495, 3668, 3634, 3661],
       [3564, 3610, 3597, 3512, 3454, 3518, 3517, 3571, 3481, 3684],
       [3472, 3482, 3498, 1545, 3317, 3419, 3436, 3414, 3485, 3684],
       [3481, 3556, 3491, 3569, 3491, 3407, 3369, 3424, 3429, 3689],
       [3490, 3375, 3551, 3492, 3470, 3468, 3462,  697, 3475, 3682],
       [3514, 3493, 3511, 3525, 3648, 3549, 3518, 3419, 3641, 3757],
       [3624, 3509, 3522, 3575, 3627, 3551, 3510, 3458, 3581, 3826],
       [3536, 3543, 3463, 3640, 3616, 3495, 3566, 3562, 3613, 3805],
       [3503, 3139, 3587, 3625, 3621, 3478, 3571, 3639, 3653, 3797]])

Coordinates: (3)

latitude
(latitude)
float64
27.5 27.1 26.7 ... 24.7 24.3 23.9
units :
degrees_north
standard_name :
latitude
```
array([27.5, 27.1, 26.7, 26.3, 25.9, 25.5, 25.1, 24.7, 24.3, 23.9])
```
longitude
(longitude)
float64
-30.7 -30.3 -29.8 ... -27.2 -26.8
units :
degrees_east
standard_name :
longitude
```
array([-30.7, -30.3, -29.8, -29.4, -29. , -28.5, -28.1, -27.6, -27.2, -26.8])
```

time

(latitude, longitude)

datetime64[ns]

1992-03-28T15:28:59.999996928 .....

long_name :: Time
standard_name :: time
axis :: T

array([['1992-03-28T15:28:59.999996928', '1992-03-31T22:40:00.000004352',
        '1992-04-01T02:51:59.999998464', '1992-04-04T14:25:00.000004352',
        '1992-04-04T18:14:59.999996672', '1992-04-07T20:43:00.000000512',
        '1992-04-08T00:30:00.000003328', '1992-04-11T00:33:00.000002560',
        '1992-04-11T04:21:59.999998464', '1992-04-14T15:16:59.999999488'],
       ['1992-03-28T19:30:00.000000000', '1992-03-31T18:17:59.999995904',
        '1992-04-01T06:51:59.999995136', '1992-04-04T10:36:59.999995136',
        '1992-04-04T22:02:59.999995904', '1992-04-07T16:58:59.999996928',
        '1992-04-08T04:15:00.000003328', '1992-04-10T20:48:00.000002560',
        '1992-04-11T08:57:59.999997184', '1992-04-14T10:44:00.000000256'],
       ['1992-03-28T23:45:00.000003328', '1992-03-31T14:15:59.999996416',
        '1992-04-01T11:09:59.999997696', '1992-04-04T06:50:00.000002304',
        '1992-04-05T01:54:00.000004608', '1992-04-07T13:16:59.999996160',
        '1992-04-08T08:00:59.999999744', '1992-04-10T17:00:00.000003328',
        '1992-04-11T12:53:59.999997952', '1992-04-14T06:05:00.000002304'],
       ['1992-03-29T03:51:59.999995136', '1992-03-31T10:16:59.999996160',
        '1992-04-01T15:05:00.000002304', '1992-04-04T02:57:00.000000768',
        '1992-04-05T06:02:00.000002816', '1992-04-07T09:30:00.000003328',
        '1992-04-08T11:45:00.000003328', '1992-04-10T13:12:00.000004096',
        '1992-04-11T16:44:00.000000256', '1992-04-14T01:16:59.999996160'],
...
        '1992-04-02T03:14:00.000000256', '1992-04-03T06:45:00.000000000',
        '1992-04-05T17:51:00.000002048', '1992-04-06T21:47:00.000002816',
        '1992-04-08T22:58:00.000000512', '1992-04-10T01:38:00.000001536',
        '1992-04-12T04:38:00.000001536', '1992-04-13T11:11:00.000004352'],
       ['1992-03-29T20:17:59.999999232', '1992-03-30T16:57:00.000004096',
        '1992-04-02T07:07:59.999998208', '1992-04-03T02:45:59.999999744',
        '1992-04-05T21:38:00.000004864', '1992-04-06T17:32:59.999995904',
        '1992-04-09T02:53:00.000004864', '1992-04-09T21:53:00.000001536',
        '1992-04-12T08:29:59.999996672', '1992-04-13T06:31:59.999996160'],
       ['1992-03-30T00:25:59.999997440', '1992-03-30T12:47:00.000002816',
        '1992-04-02T11:11:00.000004352', '1992-04-02T22:50:59.999995392',
        '1992-04-06T01:44:00.000000256', '1992-04-06T13:44:00.000000256',
        '1992-04-09T06:36:00.000002048', '1992-04-09T18:05:00.000002304',
        '1992-04-12T12:17:59.999995904', '1992-04-13T02:23:00.000001536'],
       ['1992-03-30T04:38:00.000001536', '1992-03-30T08:48:00.000002560',
        '1992-04-02T15:02:00.000002816', '1992-04-02T18:59:59.999996672',
        '1992-04-06T05:40:59.999997440', '1992-04-06T09:53:00.000001536',
        '1992-04-09T10:24:00.000001280', '1992-04-09T14:18:59.999995648',
        '1992-04-12T16:21:00.000002048', '1992-04-12T20:28:00.000003840']],
      dtype='datetime64[ns]')

Attributes: (3)
long_name :
$χ$
standard_name :
ocean_dissipation_rate_of_thermal_variance_from_microtemperature
units :
C^2 s^-1

natre.chi.cf.sel(Z=slice(2000)).cf.count("Z").plot()

<matplotlib.collections.QuadMesh at 0x177bd0280>

_images/e7594bc2c5ff25bf68f18e8d5fafdc8564767567caa91f3df3897138251a2574.png

NATRE latitudinal and longitudinal lines#

Interestingly, most “lines” do show eddy stirring : ~ 6/10 panels in the following figure.

So it does seem like a single CTD-χpod section has some chance of replicating that result.

A05 is 24.5°N, which might not be the right “place” :( but why? How can we double check this

ed.natre.plot_lines(latlines, col="latitude")
ed.natre.plot_lines(lonlines, col="longitude")

<xarray.plot.facetgrid.FacetGrid at 0x1732266d0>

_images/54436b5cf28dc97a8a7f833d3bbe8fd00b435be7e637990a46194dcd7322567e.png

_images/c213b305062e2536fbe7b74c058a8b9c941411b2b8702c927180ae93ca902c09.png

Compare 24.7°N#

NATRE data along 24.7 line is definitely lower than the NATRE mean, but not as low as CTD-χpod.

Is it possible that this latitude is just a bad place? A05 could have missed the signal, but how can we tell for sure?

A05: There is T-S spread.
A05: Higher χ values are to the east. Adding cast 131 which looks like a Meddy outlier makes a big difference
~Could use Argo to map mean spice field~ Mean spice field has basin-wide high values in 750m-1250m zone

(
    latlines.chib2.hvplot.step(y="pres", by="latitude", muted_alpha=0)
    * natre_binned.chib2.hvplot.step(y="pres", color="k", line_width=4, label="NATRE")
    * a05_binned_old.chib2.hvplot.step(
        y="pres", color="b", line_width=4, label="A05 w/ 131"
    )
    * a05_binned.chib2.hvplot.step(
        y="pres", color="g", line_width=4, label="A05 110-129"
    )
).opts(
    ylim=(2000, 0),
    logx=True,
    frame_height=500,
    frame_width=250,
    xlim=(1e-11, 1e-8),
    show_grid=True,
)

with plt.rc_context({"axes.prop_cycle": dcpy.plots.cmap_color_cycler()}):
    f, ax = plt.subplots(1, 1, constrained_layout=True)
    # 0.5m * 200 ~ 100m resolution
    # (natre.mean("longitude").cf.coarsen(Z=200, boundary="trim").mean().chi / 2).cf.plot(
    #    hue="latitude", xscale="log", zorder=0, lw=1
    # )
    for lat in latlines.latitude:
        dcpy.plots.fill_between_bounds(
            latlines.sel(latitude=lat),
            "chib2",
            y="pres",
            label=f"{lat.data:.2f}°N",
            fill=False,
        )
    dcpy.plots.fill_between_bounds(
        natre_binned, "chib2", y="pres", color="k", label="NATRE mean"
    )
    ax.set_xscale("log")
    ax.legend(bbox_to_anchor=(1, 1))

_images/6c7d92f18b552aa8f8bc729929a3b44fd5e28d692ee35d3eb45b9ad8f0a59c94.png

Compare χ profiles#

2m#

I’m not sure that this is the right “resolution”.

The CTD-χpod is technically 2m; but is quite noisy… It looks more like the 0.5m NATRE data!

kwargs = dict(
    ylim=(2000, 0),
    logx=True,
    muted_alpha=0,
    line_width=0.5,
    aspect=1 / 2,
    frame_width=250,
    xlim=(1e-12, 1e-6),
    grid=True,
)

a05_casts = chipod.cf.sel(profile_id=slice(119, 129)).chi.hvplot.line(
    by="station", y="pressure", title="A05", **kwargs
)
natre_casts = natre_2m.chi.hvplot(y="pres", by="longitude", title="NATRE", **kwargs)

(a05_casts + natre_casts).opts(title="")

0.5m#

kwargs = dict(
    ylim=(2000, 0),
    logx=True,
    muted_alpha=0,
    line_width=0.5,
    aspect=1 / 2,
    frame_width=250,
    xlim=(1e-12, 1e-6),
    grid=True,
)

a05_casts = chipod.cf.sel(profile_id=slice(119, 129)).chi.hvplot.line(
    by="station", y="pressure", title="A05", **kwargs
)
natre_casts = natre.chi.hvplot(y="pres", by="longitude", title="NATRE", **kwargs)

(a05_casts + natre_casts).opts(title="")

natre_line = latlines.sel(latitude=chipod.cf["latitude"].mean().data, method="nearest")

# dcpy.plots.fill_between_bounds(
#    avg_older, "chib2", y="pres", color="r", label="A05 w/ 131"
# )
dcpy.plots.fill_between_bounds(a05_binned, "chib2", y="pres", color="b", label="A05")
dcpy.plots.fill_between_bounds(
    natre_binned, "chib2", y="pres", color="k", label="NATRE"
)
(natre_line.chi / 2).cf.plot.step(lw=2, color="g", y="pres", label="NATRE 24.7°N")
plt.legend(bbox_to_anchor=(1, 1))
plt.xscale("log")
plt.gcf().set_size_inches((3, 5))

natre_line = latlines.sel(latitude=a05_binned.latitude.mean().data, method="nearest")

# dcpy.plots.fill_between_bounds(
#    avg_older, "chib2", y="pres", color="r", label="A05 w/ 131"
# )
dcpy.plots.fill_between_bounds(a05_binned, "chib2", y="pres", color="b", label="A05")
dcpy.plots.fill_between_bounds(
    natre_binned, "chib2", y="pres", color="k", label="NATRE"
)
(natre_line.chi / 2).cf.plot.step(lw=2, color="g", y="pres", label="NATRE 24.7°N")
plt.legend(bbox_to_anchor=(1, 1))
plt.xscale("log")
plt.gcf().set_size_inches((3, 5))

_images/3b5ba3b34c77b3dfa3dc947f92d586895789717106499c09f2cc0fad38cee0ed.png

Number of observations#

This is going to depend on the input data “resolution”

f, ax = plt.subplots(1, 2, sharey=True, constrained_layout=True)
latlines.num_obs.cf.plot(hue="latitude", y="pres", ax=ax[0], lw=1)
lonlines.num_obs.cf.plot(hue="longitude", y="pres", ax=ax[1], lw=1)
for axx in ax:
    a05_binned.num_obs.cf.plot(y="pres", ax=axx, color="k")
ax[0].legend("off")
ax[1].legend("off")

<matplotlib.legend.Legend at 0x1723caac0>

_images/b5100956aba0e9114142805a794f9a97f406f39d0429b995e389778b9f91a869.png

New dataset#

subset = a05.sel(station=slice(115, 130))
clean = subset["chipod"].ds.sel(cleaner="GE", direction="dn", pressure=slice(2000))

kwargs = dict(histtype="step", density=True, bins=np.arange(-12, -3, 0.25))

np.log10(clean.eps.where(np.abs(clean.dThdz) > 5e-3)).plot.hist(**kwargs, label="A05")
np.log10(natre.eps.sel(pres=slice(2000))).plot.hist(**kwargs, label="NATRE");

_images/4b075abb0c52522f6297721aafedfb1448dbb01fdde1f2551c6d098fac66fc45.png

clean.eps.cf.plot(norm=mpl.colors.LogNorm(), robust=True)

<matplotlib.collections.QuadMesh at 0x1806a7d30>

_images/7068c171ed19aca0bf4442df54faeb1941b528269d505814d2cf78c45b744e5e.png

f, axx = plt.subplots(1, 2, sharey=True)
for var, ax in zip(["chi", "eps"], axx):
    plt.sca(ax)
    mask = np.abs(subset["chipod"].ds["dThdz"]) > 5e-3
    (clean[var].coarsen(pressure=50, boundary="trim").mean().mean("station")).cf.plot(
        xscale="log"
    )
    subset["finescale"].ds[var].sel(pressure=slice(2000)).mean("station").cf.plot(
        hue="criteria", xscale="log"
    )
    (
        natre[var]
        .sel(pres=slice(2000))
        .coarsen(pres=200, boundary="trim")
        .mean()
        .mean(["latitude", "longitude"])
        .cf.plot(color="k")
    )

_images/b033f49d52eab5fc80778c99d54e553a60e71dd745dd8bde96cc42e9f035a4c4.png

Comparing χ distributions#

These comparisions are using 0.5m NATRE data.

Why is the error so big around 1000m

Error is sensitive to blocksize; I think this is wrong for large blocksize, we don’t get enough independent samples.
NATRE has 5x more points (after coarsening to 2m)
Ignore this following points: I think these high values are cast 131 that is an outlier in T, S too.
1. but masking out a few high values really reduces error in A05
  1. The high value criterion depends on depth (by comparing with NATRE)
    1. at 1300-ish m we need to throw out χ > 1e-7 (approx.)
    2. at 900-ish m we need to throw out χ > 1e-6 (approx).

A05 looks quite good compared to NATRE at 24.7°N for χ anyway, ε looks hopeless.

Looks like the NATRE data sampled a few different regimes possibly?
And what A05 sampled is plausibly similar to what NATRE sampled at 24.7°N?

All 2m NATRE data#

inds = slice(0, 14)
print(f"depth range: {np.arange(150, 3150, 100)[inds]}")
print(f"density bins: {bins[inds]}")

a05_grouped = a05.sel(cast=a05_binned.cast).groupby_bins("gamma_n", bins=bins[inds])
natre_grouped = natre_2m.groupby_bins("gamma_n", bins=bins[inds])

ed.natre.compare_chi_distributions(a05_grouped, natre_grouped)

depth range: [ 150  250  350  450  550  650  750  850  950 1050 1150 1250 1350 1450]
density bins: [26.43 26.68 26.87 27.03 27.16 27.28 27.4  27.51 27.6  27.67 27.74 27.79
 27.83 27.87]
(26.43, 26.68]
(26.68, 26.87]
(26.87, 27.03]
(27.03, 27.16]
(27.16, 27.28]
(27.28, 27.4]
(27.4, 27.51]
(27.51, 27.6]
(27.6, 27.67]
(27.67, 27.74]
(27.74, 27.79]
(27.79, 27.83]
(27.83, 27.87]

_images/53c7c91ef9644be77c6c205934992bf7cdf4205582e0c554ac10f8213b1fc4e2.png

_images/de27de4da9feff28ff52ae640bf9e438876a819075dcb85cafae4fa3e39b4e93.png

_images/43f6516474a8042fd509479a40b0563f09377a8aec14bcc2498bae93305bd3e3.png

_images/aa141295e493b8d78f45661eb896bef2eea5de6954217d115d79fe5b10879302.png

_images/4023993ac6003c5ff1983322eac9fdf33282a469a04ff2c71cacf9c70a53dcdd.png

_images/65758b4a57b39e7c141a89e579704b167368e14f727740b1b91e7b8e2225dec0.png

_images/b622aa2600de803a8c55c0d25e4aba1ef2fd2b8f0cb9bfdb583b2436e6295170.png

_images/d417440c7a30034e03d13a5db498e8503974bd97de174ac399f766febc2659cd.png

_images/00a5ce8b4b766d7ef601318adb9206830b77396c6307ce21bcdba2272217cf83.png

_images/6ae1dfae1b96ae7d13bcb8b1da01d58a282d6d9c8b8075bd3722a5bc1a230a2c.png

_images/d0469829dbcf8587d8d94f151abaf9811a5e9d48dbe2d729c643f493362e2b9a.png

_images/7d62115bbb845829d86101aa5d23a60a77c47ab26382cb48af3bbf898ce3c82c.png

_images/4c6e39ba4eaca7651fe7e0a701efe2ccb62dcedc05d3af42d1fefaa936b4a474.png

All 0.5m NATRE data#

inds = slice(0, 14)
print(f"depth range: {np.arange(150, 3150, 100)[inds]}")
print(f"density bins: {bins[inds]}")

a05_grouped = a05.sel(cast=a05_binned.cast).groupby_bins("gamma_n", bins=bins[inds])
natre_grouped = natre.groupby_bins("gamma_n", bins=bins[inds])

ed.natre.compare_chi_distributions(a05_grouped, natre_grouped)

depth range: [ 150  250  350  450  550  650  750  850  950 1050 1150 1250 1350 1450]
density bins: [26.43 26.68 26.87 27.03 27.16 27.28 27.4  27.51 27.6  27.67 27.74 27.79
 27.83 27.87]
(26.43, 26.68]
(26.68, 26.87]
(26.87, 27.03]
(27.03, 27.16]
(27.16, 27.28]
(27.28, 27.4]
(27.4, 27.51]
(27.51, 27.6]
(27.6, 27.67]
(27.67, 27.74]
(27.74, 27.79]
(27.79, 27.83]
(27.83, 27.87]

_images/e427841b96bfca06ffb4cf05d1693a01320064dfb2dd6d059f04d216c12de57e.png

_images/6230b5357596563dce04e33ceed8373808abf38152ff49da6f2e41d665bd7516.png

_images/0e3a24b1edbac436beed7d159725339aba55ede9851940528cd58c94bb258fbf.png

_images/045eaa42546e35b723a908799a6566cf220f2fe263274c92fb258d347424ac90.png

_images/4cedc168850c6ae95de2f016dbfb4b8dfae89cd158bccbe38172a7d79941f10f.png

_images/547c926541114bf88920ffbe2135fd415aa42993f253addb649cbaa420471f08.png

_images/03808f3de355e39f57d347004d3eb42de4ddc50427491c6c88ff08abd42b2758.png

_images/f85a07b820d63345ba9442209021f515afcb871e40e64eb332355f4b0434e909.png

_images/5e593fa6ef7dadd09c6ed66a78b92dda9f457ce17fb0a2aabbf60490732d7abf.png

_images/10ada1cedfe23ef0800c0cda379c0fd02357bbf4e8f1f7bcbcfb8cbf25471c13.png

_images/94976c9f4813bd08efd1065f4f208e4e255d23de80a3f71525c93e2cd5acf68f.png

_images/b315b39f927f27a46c031a0b47fdb21dcda8e911893cb52453a151c6e01006d2.png

_images/bf62595363f50693aa5e6306a355a82426e350108396cba42def342b47bad62c.png

2m NATRE at 24.7°N#

inds = slice(0, 14)
print(f"depth range: {np.arange(150, 3150, 100)[inds]}")
print(f"density bins: {bins[inds]}")

a05_grouped = a05.sel(cast=a05_binned.cast).groupby_bins("gamma_n", bins=bins[inds])
natre_grouped = natre_2m.sel(latitude=24.7, method="nearest").groupby_bins(
    "gamma_n", bins=bins[inds]
)

ed.natre.compare_chi_distributions(a05_grouped, natre_grouped)

depth range: [ 150  250  350  450  550  650  750  850  950 1050 1150 1250 1350 1450]
density bins: [26.43 26.68 26.87 27.03 27.16 27.28 27.4  27.51 27.6  27.67 27.74 27.79
 27.83 27.87]
(26.43, 26.68]
(26.68, 26.87]
(26.87, 27.03]
(27.03, 27.16]
(27.16, 27.28]
(27.28, 27.4]
(27.4, 27.51]
(27.51, 27.6]
(27.6, 27.67]
(27.67, 27.74]
(27.74, 27.79]
(27.79, 27.83]
(27.83, 27.87]

_images/65a4521bef016a601f6b6da08fe5989b2e592ebe062f2f698bb659e054a819ab.png

_images/6149dd7c0f54ce395f2a542de7ff4738980308a8b41b29250b08c3e2f30ba620.png

_images/812d7823fa244c1f632e57200c9701be090d21e6d0309acf860f99f6958c4d1d.png

_images/1b3098677769c7a04ad5b59b0075c078ba8808b25ba5221a995c7075f65e07df.png

_images/4940a3b68afecd075d312f0b6d6439e302c4b13c77319d34a27f4705e47affcc.png

_images/156dd6b7ebc10516020b37189fec18e51a4c6b4c5c41a9c7a3bb214dc1b3c1d1.png

_images/fc7791d00a62b0a796f857e60a5a6025a5dde97add852d09bfb37c5d17897807.png

_images/17b4b0014b4f3b38ee7257ad32c5ab223e2a6aa40fcec00abf2b15284a5936f4.png

_images/aea4fcc584f55c244d83da0f2097a887dd14abd5d884b986513c66e1b033db1b.png

_images/2ce0d82acdf5b41ef50d6575e1cce26fe8324e0024f4d962f69bdff0ea26247a.png

_images/570a45e32bfd30406a5913127251c77a4825bf3ab923e04bb748ac2507265122.png

_images/0221f433bec3c97573196eb2bdd537ac6936a16cfecc9baba16e72ddfd405729.png

_images/e2ed12d63d380ee120065753b50766b613e0135dbf3b538609c28a544c0894ea.png

0.5m NATRE at 24.7°N#

inds = slice(0, 14)
print(f"depth range: {np.arange(150, 3150, 100)[inds]}")
print(f"density bins: {bins[inds]}")

a05_grouped = a05.sel(cast=a05_binned.cast).groupby_bins("gamma_n", bins=bins[inds])
natre_grouped = natre.sel(latitude=24.7, method="nearest").groupby_bins(
    "gamma_n", bins=bins[inds]
)

ed.natre.compare_chi_distributions(a05_grouped, natre_grouped)

depth range: [ 150  250  350  450  550  650  750  850  950 1050 1150 1250 1350 1450]
density bins: [26.43 26.68 26.87 27.03 27.16 27.28 27.4  27.51 27.6  27.67 27.74 27.79
 27.83 27.87]
(26.43, 26.68]
(26.68, 26.87]
(26.87, 27.03]
(27.03, 27.16]
(27.16, 27.28]
(27.28, 27.4]
(27.4, 27.51]
(27.51, 27.6]
(27.6, 27.67]
(27.67, 27.74]
(27.74, 27.79]
(27.79, 27.83]
(27.83, 27.87]

_images/969dd4144912eeb48a362ed52f1d408deeadaaed27f6e0ad4e652c8bfc88b63e.png

_images/8abdbf35d4fd3680febd8c557db268c0887b93dbb72b1bf5958e71aac7bb8d04.png

_images/d299e48f870aee9b8e18060f5419a3e7333939921c109552715848e947790d5e.png

_images/3d5562860d6aaef1972bad36c584f7a38d26b720ed9a0ffe9767952cae5f87a2.png

_images/ee0bdcb935013cc005f8dd0af5f8f9cbadbd93cfb16ba7920cca3039173b5d97.png

_images/138b58dea13687080f113fe1a389e4a125438121bdfdc837c82f892c56a6603f.png

_images/3475b435d5ae22d1ed1d59a3359bf8409015caae0e5f29f0848b428035fe9379.png

_images/f07dc79fe9380454a3f4a4a4a4ad8ffbde08d179c9afe28968652659ddee07fe.png

_images/af497b5b2316440edd043be13aede88981999f7243bec17636151fbfe835551d.png

_images/c15d4c19bb3a379c41c3cdcffcfeec251955b11672280c0a262364a123857674.png

_images/575247890c353abd23c563b9a7d2557f7706ddfa05f5b8090bcabbe2282fde5e.png

_images/cd4f2b78732027f329a84c9889fa73f7cecdf6b04ad6b1fc5cc2f5e28219b604.png

_images/ba79712b6371686a2f77c1d21478da6c1fac75be2bccbe83b34f61d574fae04b.png

Comparing some terms#

f, ax = plt.subplots(1, 3, sharey=True, constrained_layout=True)

for var, axx in zip(["chi", "hm", "residual"], ax.flat):
    dcpy.plots.fill_between_bounds(a05_binned, var, y="pres", ax=axx)
    dcpy.plots.fill_between_bounds(natre_binned, var, y="pres", color="k", ax=axx)
    axx.set_ylim((2000, 200))
    axx.set_xlabel(var)
ax[0].set_xscale("log")
ax[0].set_xlim([4e-11, 1e-8])
ax[2].set_xscale("log")

_images/f23585af1e8862b75a7c4a7bb4ab3b960763d34641db12d4d784b6eacf8d6657.png

dcpy.plots.fill_between_bounds(a05_binned, "chi", y="pres", color="k")
dcpy.plots.fill_between_bounds(natre_binned, "chi", y="pres")
plt.ylim((2000, 200))
plt.xscale("log")
plt.gcf().set_size_inches((4, 6))

_images/2b245e6a3515be31afd730cc3cdb69fd1c58a6f356f2c278699061cd12655716.png

a05_binned.gamma_n.plot(y="pres", marker="x")
natre_binned.gamma_n.plot(y="pres", marker="x")

[<matplotlib.lines.Line2D at 0x181932070>]

_images/e993417c749081103826107997a52649519f4d180028d49646127721287dd8a1.png