read.adv.sontek.text {oce} | R Documentation |
Read an adv File
Description
Read an ADV data file, producing an object of type adv
. This
function works by transferring control to a more specialized function,
e.g. read.adp.nortek()
and read.adp.sontek()
,
and in many cases users will find it preferable to either use these or
the several even more specialized functions, if the file type is
known.
Usage
read.adv.sontek.text(
file,
from = 1,
to,
by = 1,
tz = getOption("oceTz"),
originalCoordinate = "xyz",
transformationMatrix,
longitude = NA,
latitude = NA,
encoding = "latin1",
monitor = FALSE,
debug = getOption("oceDebug"),
processingLog = NULL
)
Arguments
file |
a connection or a character string giving the name of the file
to load. It is also possible to give |
from |
index number of the first profile to be read, or the time of
that profile, as created with |
to |
indication of the last profile to read, in a format matching that
of |
by |
an indication of the stride length to use while walking through
the file. This is ignored if |
tz |
character string indicating time zone to be assumed in the data. |
originalCoordinate |
character string indicating coordinate system, one
of |
transformationMatrix |
transformation matrix to use in converting beam
coordinates to xyz coordinates. This will over-ride the matrix in the file
header, if there is one. An example is |
longitude |
optional signed number indicating the longitude in degrees East. |
latitude |
optional signed number indicating the latitude in degrees North. |
encoding |
a character value that indicates the encoding to be used for
this data file, if it is textual. The default value for most functions is
|
monitor |
boolean value indicating whether to indicate the progress
of reading the file, by using |
debug |
a flag that turns on debugging. The value indicates the depth
within the call stack to which debugging applies. For example,
|
processingLog |
if provided, the action item to be stored in the log. This parameter is typically only provided for internal calls; the default that it provides is better for normal calls by a user. |
Details
Files without headers may be created in experiments in which a data logger was set up to monitor the serial data stream from an instrument. The lack of header information places a burden on the user, who must supply such basic information as the times of observations, the instrument orientation, the instrument coordinate system, etc. Example 3 below shows how to deal with such files. Three things should be noted.
The user must choose the appropriate
read.adv
variant corresponding to the instrument in question. (This is necessary becauseoceMagic()
, which is used by the genericread.oce()
routine, cannot determine the type of instrument by examining a file that lacks a header.)The call to the
read
function must include a start time (start
) and the number of seconds between data (deltat
), again, because the instrument data stream may lack those things when the device is set to a serial mode. Also, of course, it is necessary to setheader=FALSE
in the function call.Once the file has been read in, the user will be obliged to specify other information, for the object to be well-formed. For example, the
read
function will have no way of knowing the instrument orientation, the coordinate system being used, the transformation matrix to go from"beam"
to"xyz"
coordinates, or the instrument heading, pitch, and roll, to go from"xyz"
coordinates to"enu"
coordinates. Such things are illustrated in example 3 below.
In ADV data files, velocities are coded to signed 2-byte integers, with a
scale factor being used to convert to velocity in metres per second. These
two facts control the maximum recordable velocity and the velocity
resolution, values that may be retrieved for an ADV object name d
with d[["velocityMaximum"]]
and d[["velocityResolution"]]
.
Value
An adv object that contains measurements made with an ADV device.
The metadata
contains information as given in the following table.
The Nortek name'' is the name used in the Nortek System Integrator Guide (reference 1) and the
Sontek name” is the name used in the relevant
Sontek documentation. References are given in square brackets.
metadata name | Nortek name | Sontek name | Meaning |
manufacturer | - | - | Either "nortek" or "sontek" |
instrumentType | - | - | Either "vector" or "adv" |
filename | - | - | Name of data file(s) |
latitude | - | - | Latitude of mooring (if applicable) |
longitude | - | - | Longitude of mooring (if applicable) |
numberOfSamples | - | - | Number of data samples in file |
numberOfBeams | NBeams (reference 1, p18) | - | Number of beams (always 3) |
numberOfBeamSequencesPerBurst | NPings | - | number of beam sequences per burst |
measurementInterval | MeasInterval (reference 1 p31) | - | |
samplingRate | 512/(AvgInterval) (reference 1 p30; reference 4) | - #' | |
The data
list contains items with names corresponding to adp
objects, with an exception for Nortek data. Nortek instruments report some
things at a time interval that is longer than the velocity sampling, and
these are stored in data
as timeSlow
, headingSlow
,
pitchSlow
, rollSlow
, and temperatureSlow
; if burst
sampling was used, there will also be items recordsBurst
and
timeBurst
.
The processingLog
is in the standard format.
Nortek files
Sampling-rate and similar issues
The data format is inferred from the System Integrator Guide (reference 1A) and System
Integrator Manual (reference 1B). These document lacks clarity in spots, and so
read.adv.nortek
contains some assumptions that are noted here, so
that users will be aware of possible problems.
A prominent example is the specification of the sampling rate, stored in
metadata$sampingRate
in the return value. Repeated examination of
the System Integrator Guide (reference 1) failed to indicate where this value is
stored in the various headers contained in Vector datasets. After some
experimentation with a few data files, read.adv.nortek
was set up to
calculate metadata$samplingRate
as 512/AvgInterval
where
AvgInterval
is a part of the User Configuration'' header (reference 1 p30), where the explanation is
average interval in seconds”). This formula was
developed through trial and error, but it was confirmed later on the Nortek
discussion group, and it should appear in upcoming versions of (reference 1).
Another basic issue is the determination of whether an instrument had
recorded in continuous mode or burst mode. One might infer that
TimCtrlReg
in the User Configuration'' header (reference 1 p30) determines this, in bits 1 and 2. However, this was the case in test files available to the author. For this reason, `read.adv.nortek` infers the mode by reverse engineering of data files of known configuration. The present version of `read.adv.nortek` determines the sampling mode from the ```NRecords`'' item of the
Vector Velocity Data” header, which seems
to be 0 for data collected continuously, and non-zero for data collected in
bursts.
Taking these things together, we come upon the issue of how to infer
sampling times for Nortek instruments. There do not seem to be definitive
documents on this, and so read.adv.nortek
is based partly on
information (of unknown quality) found on Nortek discussion boards. The
present version of read.adv.nortek
infers the times of velocity
observations differently, depending on whether the instrument was set to
record in burst mode or continuous mode. For burst mode, times stated in
the burst headers are used, but for continuous mode, times stated in the
“vector system data” are used. On the advice found on a Nortek discussion
board, the burst-mode times are offset by 2 seconds to allow for the
instrument warm-up period.
Handling IMU (inertial measurement unit) data
Starting in March
2016, read.adv.nortek
has offered some support for handling IMU
(inertial measurement unit) data incorporated into Nortek binary files. This
is not described in the Nortek document named System Integrator Guide'' (reference 1A) but it appeared in
System Integrator Manual” (reference 1B;
reference 1C). Confusingly, 1B described 3 varieties of data, whereas 1C does not
describe any of these, but describes instead a fourth variety. As of March
2016, read.adv.nortek
handles all 4 varieties, because files in the
various schemes appear to exist. In oce
, the varieties are named
after the byte code that flags them. (Variety c3
is the one
described in (reference 1C); the others were described in (reference 1B).)
The variety is stored in the metadata
slot of the returned object as
a string named IMUtype
.
For each variety, the reader is cautioned that strong tests have not been
performed on the code. One way to test the code is to compare with textual
data files produced by the Nortek software. In March 2016, an oce
user shared a dataset of the c3
variety, and this permitted detailed
comparison between the text file and the values inferred by
read.adv.nortek
. The test suggested agreement (to within the
resolution printed in the text file) for velocity (v
in the
data
slot), signal amplitude (a
), correlation (q
),
pressure (p
), the three components of IMU delta angle
(IMUdeltaAngleX
etc), and all components of the rotation matrix
(IMUrotation
). However, the delta velocity signals did not match,
with IMUdeltaVelocityX
disagreeing in the second decimal place,
IMUdeltaVelocityY
component disagreeing in the first, and
IMUdeltaVelocityZ
being out by a factor of about 10. This is github
issue 893 (https://github.com/dankelley/oce/issues/893
).
Variety
c3
(signalled by byte 5 of a sequence being0xc3
) provides information on what Nortek calls DeltaAngle, DeltaVelocity and Orientation Matrix. (Apart from the orientation matrix, Nortek provides no documentation on what these quantities mean.) In the object returned byread.adv.nortek
, these are stored in thedata
slot asIMUdeltaAngleX
,IMUdeltaAngleY
,IMUdeltaAngleZ
,IMUdeltaVelocityX
,IMUdeltaVelocityY
,IMUdeltaVelocityZ
, andIMUrotation
, all vectors except the last, which is a 3D array. In addition to these,IMUtimestamp
is a time-stamp, which is not defined in the Nortek documents but seems, from IMU documents (reference 5), to be defined based on a clock that ticks once per 16 microseconds. Caution may be required in dealing with this timestamp, since it seemed sensible in one test case (varietyd3
) but kept reseting to zero in another (varietyc3
). The lack of Nortek documentation on most of these quantities is a roadblock to implementingoce
functions dealing with IMU-enabled datasetsVariety
cc
(signalled by byte 5 of a sequence being0xcc
) provides information on acceleration, angular rotation rate, magnetic vector and orientation matrix. Each is a timeseries. Acceleration is stored in thedata
slot asIMUaccelX
,IMUaccelY
,IMUaccelz
. The angular rotation components areIMUngrtX
,IMUngrtY
andIMUngrtz
. The magnetic data are inIMUmagrtx
,IMUmagrty
andIMUmagrtz
. Finally,IMUmatrix
is a rotation matrix made up from elements namedM11
,M12
, etc in the Nortek documentation. In addition to all of these,IMUtime
stores time in seconds, with an origin whose definition is not stated in reference 1B.Variety
d2
(signalled by byte 5 being0xd2
) provides information on gyro-stabilized acceleration, angular rate and magnetometer vectors. The data storedMUaccelX
,IMUangrtX
,IMUmagrtX
, with similar forY
andZ
. Again, time is inIMUtime
. This data type has not been tested as of mid-March 2016, because the developers do not have a test file with which to test.Variety
d3
(signalled by byte 5 being0xd3
) provides information on DeltaAngle, DeltaVelocity and magnetometer vectors, stored inIMUdeltaAngleX
,IMUdeltaVelocityX
, andIMUdeltaMagVectorX
, with similar forY
andZ
. Again, time is inIMUtime
. This data type has not been tested as of mid-March 2016, because the developers do not have a test file with which to test.
Note on file name
The file
argument does not actually name a file. It names a basename
for a file. The actual file names are created by appending suffix
.hd1
for one file and .ts1
for another.
How the binary file is decoded
This file type, like other acoustic-Doppler types, is read with a hybrid R/C++ system, for efficiency. The processing steps are sketched below, for users who want to inspect the code or build upon it.
In R,
readBin()
is used to insert the file contents into a vector of typeraw
.In C++, this raw vector is scanned byte by byte, to find the starting indices of data "chunks", or subsections of the data that correspond to individual sampling times. Checksum computations are also done at this stage, to detect possible data corruption. Warnings are issued for any bad chunks, and they are skipped in further processing. The valid starting points are then passed back to R as a vector of type
integer
.In R,
readBin()
is used to read the components of each chunk. For speed, this is done in a vectorized fashion. For example, all the velocities in the whole file are read in a single call toreadBin()
. This process is done for each of the data fields that are to be handled. Importantly, thesereadBin()
calls are tailored to the data, using values of thesize
,endian
andsigned
parameters that are tailored to the structure of the given component. Scaling factors are then applied as required, to convert the components to physical units.Finally, in R, the acquired items are inserted into the
data
ormetadata
slot of the return value, according to oce convention.
Author(s)
Dan Kelley
References
1A. Nortek AS. System Integrator Guide (paradopp family of products). June 2008. (Doc No: PSI00-0101-0608). (Users may find it helpful to also examine newer versions of the guide.)
1B. Nortek AS. System Integrator Manual. Dec 2014.
(system-integrator-manual_Dec2014_jan.pdf
)
1C. Nortek AS. System Integrator Manual. March 2016.
(system-integrator-manual_Mar2016.pdf
)
SonTek/YSI ADVField/Hydra Acoustic Doppler Velocimeter (Field) Technical Documentation (Sept 1, 2001).
Appendix 2.2.3 of the Sontek ADV operation Manual, Firmware Version 4.0 (Oct 1997).
Nortek Knowledge Center (http://www.nortekusa.com/en/knowledge-center)
A document describing an IMU unit that seems to be close to the one named in (references 1B and C) as being an adjunct to Nortek Vector systems is at
http://files.microstrain.com/3DM-GX3-35-Data-Communications-Protocol.pdf
See Also
Other things related to adv data:
[[,adv-method
,
[[<-,adv-method
,
adv-class
,
advSontekAdrFileTrim()
,
adv
,
applyMagneticDeclination,adv-method
,
beamName()
,
beamToXyz()
,
enuToOtherAdv()
,
enuToOther()
,
plot,adv-method
,
read.adv.nortek()
,
read.adv.sontek.adr()
,
read.adv.sontek.serial()
,
read.adv()
,
rotateAboutZ()
,
subset,adv-method
,
summary,adv-method
,
toEnuAdv()
,
toEnu()
,
velocityStatistics()
,
xyzToEnuAdv()
,
xyzToEnu()
Examples
## Not run:
library(oce)
# A nortek Vector file
d <- read.oce("/data/archive/sleiwex/2008/moorings/m05/adv/nortek_1943/raw/adv_nortek_1943.vec",
from=as.POSIXct("2008-06-26 00:00:00", tz="UTC"),
to=as.POSIXct("2008-06-26 00:00:10", tz="UTC"))
plot(d, which=c(1:3,15))
## End(Not run)