Upper Midwest Environmental Sciences Center

Upper Midwest Environmental Sciences Center - Macroinvertebrate Data - File Format Information

Macroinvertebrate File Format Information


Last updated on February 23, 1999

The detailed information in these sections provides the data users with information about file name signifance and a description of the data. The file naming convention explanation section describes the type of data, field station that collected the data, year of collection and data batch number, while the data descriptions include field column positions, historical summary, and field descriptions. Data from 1992 or earlier is sorted by date, sample grid, UTM Easting, and UTM Northing in that order. Data from 1993 or later is sorted by barecode only. The detailed data description section described below is available in downloadable ASCII form.

File Naming Convention Explanation

The first three characters denote the data component;
FSH = fish
INV = macroinvertebrate
VEG = vegetation transect
VGI = vegetation informal
WTR = water quality

 

The fourth character is the field station that collected the data;
1 = Lake City, MN (Mississippi River, Pool 4)
2 = Onalaska, WI (Mississippi River, Pool 8)
3 = Bellevue, IA (Mississippi River, Pool 13)
4 = Alton, MO (Mississippi River, Pool 26)
5 = Cape Girardeau, MO (Mississippi River, Open River)
6 = Havana, IL (Illinois River, LaGrange Pool)

The fifth and sixth characters denotes the year the data is from.

The seventh and eighth characters denotes that years sequential batch of data.

The FNL file name prefix denotes final data.

Data Descriptions

Field Column Position

The "Format" column contains the following format specifiers:
a = alpha character (always UPPER case)
b = a blank space
blank = a missing value
c = alpha character/symbol/digit
c(xx) = character string where xx represents the maximum number of characters allowed
9 = any digit, 0 - 9
MM/DD/YYYY = date format
HH:MI = time format
Field Name Field Position Format Value Range/
Default Value
BARCODE
1:8
99999999
0 - 99999999
SAMPLE GRID
9:13
99999
-
DATE
14:21
MM/DD/YYYY
05/01/1992 -
Current Date
22:26
a-999
-
FIELD STATION #
27:27
9
0,1,2,3,4,5,6
HABITAT CLASS
28:32
aaa-a
(See Appendix)
RIVER MILE
33:37
c99.9
-
SUBSAMPLE TAKEN
38:38
c
N,A,B
UTM ZONE (ZONE 15)
39:40
99
15,16
UTM EASTING (ZONE 15)
41:46
999999
-
UTM NORTHING (ZONE 15)
47:53
9999999
-
UTM ZONE (EMTC)
54:55
99
15,16
UTM EASTING (EMTC)
56:61
999999
-
UTM NORTHING (EMTC)
62:68
9999999
-
UTM ZONE (FIELD)
69:70
99
15,16
UTM EASTING (FIELD)
71:76
999999
-
UTM NORTHING (FIELD)
77:83
9999999
-
ACCURACY
84:87
99.9
0 - 99.9
DEPTH
88:91
99.9
0 - 99.9
# CONTAINERS
92:93
99
1 - 99
# CONTAINERS PER REPLICATE - A
94:94
9
1 - 9
# CONTAINERS PER REPLICATE - B
95:95
9
1 - 9
# CONTAINERS PER REPLICATE - C
96:96
9
1 - 9
97:97
9
1,2,3
GEAR CODE
98:98
a
(See Appendix)
99:99
a
Y,N
SUBSTRATE SAMPLE TAKEN
100:100
a
Y,N
% SUBMERSED AND FLOATING LEAF VEGETATION (Ponar)
101:101
9
0,1,2,3,4
PREDOMINANT SUBSTRATE
102:102
9
0,1,2,3,4,5,6
SHELLS
103:103
c
Y,N
DETRITUS
104:104
c
Y,N
FINE ORGANIC MATERIAL
105:105
c
Y,N
SUBMERSED VEGETATION
106:106
9
0,1,2,3,4
FLOATING LEAF
107:107
9
0,1,2,3,4
108:108
9
0,1,2,3,4
OPEN WATER
109:109
9
0,1,2,3,4
MAYFLIES
110:113
9999
0 - 9999
FINGERNAIL CLAMS
114:117
9999
0 - 9999
CORBICULA
118:121
9999
0 - 9999
MIDGES
122:125
9999
0 - 9999
ZEBRA MUSSELS
126:129
9999
0 - 9999
ODONATA
130:130
c(1)
A,P
PLECOPTERA
131:131
c(1)
A,P
TRICHOPTERA
132:132
c(1)
A,P
DIPTERA
133:133
c(1)
A,P
UNIONIDAE
134:134
c(1)
A,P
OLIGOCHAETA
135:135
c(1)
A,P
DECAPODA
136:136
c(1)
A,P
AMPHIPODA
137:137
c(1)
A,P
GASTROPODA
138:138
c(1)
A,P
COMMENTS
139:218
c(80)
-
RECORDER CODE
219:222
9999
000 - 9999
ENTRY OPERATOR
223:225
ccc
000 - 999
VERIFY OPERATOR
217:219
ccc
000 - 999
BATCH NUMBER
220:223
9999
-

Field Descriptions

BARCODE
Identifying barcode for macroinvertebrate data sheets.
uparrow
SAMPLE GRID
Sample grid area for a collection site. Grid numbers are unique descriptors for each sample site location, and are derived from a pre-determined UTM grid system developed at EMTC.
uparrow
DATE
The month, day, and year the site was sampled. Format is mm/dd/yyyy.
uparrow
PROJECT CODE
Alphanumeric LTRMP project code. Format is A-nnn, where A is a letter describing project type and nnn is a special project number. Project types are as follows:
M = RTA standardized resource monitoring
B = HREP biological response monitoring
E = Ad hoc exploratory sampling
R = Special research project
uparrow
FIELD STATION #
Numeric code identifying the field station number. Listed below are the valid codes and their corresponding field station.
1 = Lake City, MN
2 = Onalaska, WI
3 = Bellevue, IA
4 = Wood River, IL
5 = Cape Girardeau, MO
6 = Havana, IL
uparrow
HABITAT CLASS
Alphabetic code LTRMP habitat class description. Valid codes are as follows:
BWC = Backwater contiguous
IMP = Impounded
MC = Main channel
MCB = Main channel border
SC = Side channel
TDL = Tributary delta lake (Lake Pepin only)
uparrow
RIVER MILE
The location of the site to the nearest tenth of a river mile.
uparrow
SUBSAMPLE TAKEN
Code if a subsample was taken (replaced by SUBSTRATE SAMPLE TAKEN in 1993). Valid codes are as follows:
Y = Yes.
N = No
uparrow
UTM ZONE (ZONE 15)
Numeric field to record Global Positioning System (GPS) zone. LTRMP data resides in either zone 15 or 16. If the GPS zone = 16 the GPS zone is converted to zone 15.
uparrow
UTM EASTING (ZONE 15)
East UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is computed at EMTC and is then used by the collection. This is the coordinate recorded in the field and converted to zone 15.
uparrow
UTM NORTHING (ZONE 15)
North UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is computed at EMTC and is then used by the collection. This is the coordinate recorded in the field and converted to zone 15.
uparrow
UTM ZONE (EMTC)
Numeric field to record Global Positioning System (GPS) zone. LTRMP data resides in either zone 15 or 16. This zone is derived at EMTC and is then used by the collection crews to locate where they are to do their sampling.
uparrow
UTM EASTING (EMTC)
East UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is derived at EMTC and is then used by the collection crews to locate where they are to do their sampling.
uparrow
UTM NORTHING (EMTC)
North UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is derived at EMTC and is then used by the collection crews to locate where they are to do their sampling.
uparrow
UTM ZONE (FIELD)
Numeric field to record Global Positioning System (GPS) zone. LTRMP data resides in either zone 15 or 16. This zone is derived in the field when the collection crew can not sample the (EMTC derived) collection site. This is the actual location of the collection site used by the collection crews.
uparrow
UTM EASTING (FIELD)
East UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is derived in the field when the collection crew can not sample the (EMTC derived) collection site. This is the actual location of the collection site used by the collection crews.
uparrow
UTM NORTHING (FIELD)
North UTM (Universal Transverse Mercator) coordinates of collection sites. This coordinate is derived in the field when the collection crew can not sample the (EMTC derived) collection site. This is the actual location of the collection site used by the collection crews.
uparrow
ACCURACY
Numeric field to record a measure of positioning accuracy. Record Percent Dilution of Precision (PDOP) from GPS devices and the following codes for base map cross-reference and revisits to marked fixed sites:
00.1 = Almost certain accuracy within 100 m because of immediate proximity to uniquely identifiable features (undisturbed marker, wing dam, day mark, etc.).
00.2 = High confidence of accuracy within 100 m because of agreement between general site appearance and identifiable features on a base map.
00.3 = Other than above.
uparrow
DEPTH
Water depth recorded in meters.
uparrow
# CONTAINERS
The total number of sample containers collected at the site and taken back to the lab (replaced # containers per replicate A, B, C in 1993).
uparrow
# CONTAINERS PER REPLICATE - A
The number of sample containers collected at the site and taken back to the lab for the first sample taken at a site (applies to 1992 only).
uparrow
# CONTAINERS PER REPLICATE - B
The number of sample containers collected at the site and taken back to the lab for the second sample taken at a site (applies to 1992 only).
uparrow
# CONTAINERS PER REPLICATE - C
The number of sample containers collected at the site and taken back to the lab for the third sample taken at a site (applies to 1992 only).
uparrow
SUMMARY CODE
A numeric code representing whether or not the sample was kept and the circumstances surrounding the decision to accept or reject. Valid codes are as follows:
1 = Sample taken
2 = Sample taken but not kept due to hard substrate
3 = Sample taken but not kept due to detritus, shells, etc.
4 = Other
uparrow
GEAR CODE
A code representing the gear type used to collect the sample. Currently, there is only one code for macroinvertebrates (P = Ponar).
uparrow
LAB SAMPLE TAKEN
Code if the sample was chosen for lab analysis. Valid codes are as follows:
Y = Yes, sample chosen for lab analysis.
N = No, sample not chosen for lab analysis.
uparrow
SUBSTRATE SAMPLE TAKEN
A code describing if a subsample is taken, and if so, what size of sampler was used (replaced SUBSAMPLE TAKEN in 1993). Valid codes are as follows:
N = No subsample taken.
A = Large subsample (268 square centimeters)
B = Small subsample (162 square centimeters)
uparrow
% SUBMERSED AND FLOATING LEAF VEGETATION (Ponar)
This visual observation is a subjective measurement based on the column of water and sediment the Ponar falls through. A numeric code is recorded representing the percentage of submersed and floating-leaf vegetation. This field will provide data on whether or not macroinvertebrates associated with vegetation are in the sample and the relationship of sites with and without vegetation.
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
PREDOMINANT SUBSTRATE
A code to represent the qualitative observation of sediments based on visual and tactile observations at the site. Valid codes are as follows:
1 = Hard clay
2 = Silt/Clay
3 = Mostly silt/clay with sand
4 = Mostly sand
5 = Sand
6 = Gravel/rock
uparrow
SHELLS
A code representing the percentage of dead shells in the sample wash frame. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
DETRITUS
A code representing the percentage of detritus (fibrous material on the surface and within the sediment) in the sample wash frame. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
FINE ORGANIC MATERIAL
A code indicating if fine organic material (characterized by black coloration and odor) is present in the sample. Valid codes are as follows:
Y = Yes, fine organic material present
N = No, fine organic material not present
uparrow
SUBMERSED VEGETATION
A code representing the percentage of submersed vegetation within a 15-m radius from the boat. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
FLOATING LEAF
A code representing the percentage of floating leaf vegetation within a 15-m radius from the boat. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
EMERGENT VEGETATION
A code representing the percentage of emergent vegetation within a 15-m radius from the boat. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
OPEN WATER
A code representing the percentage of open water within a 15-m radius from the boat. Valid codes are as follows:
0 = 0%
1 = 1%-20%
2 = 21%-50%
3 = 51%-90%
4 = 91%-100%
uparrow
MAYFLIES
The number of mayfly larvae found in the collection sample for the site.
uparrow
FINGERNAIL CLAMS
The number of fingernail clams found in the collection sample for the site.
uparrow
CORBICULA
The number of corbicula found in the collection sample for the site.
uparrow
MIDGES
The number of midges found in the collection sample for the site.
uparrow
ZEBRA MUSSELS
The number of zebra mussels found in the collection sample for the site (not started recording until 1995).
uparrow
COMMENTS
Eighty-character field to record miscellaneous comments and observations.
uparrow
RECORDER CODE
This number uniquely identifies the person who coded the data sheet.
uparrow
ENTRY OPERATOR
Alphanumeric field to identify the operator responsible for entering (keying) the macroinvertebrate data into an electronic format.
uparrow
VERIFY OPERATOR
Alphanumeric field to identify the operator responsible for verifying the electronic keyed macroinvertebrate data.
uparrow
BATCH NUMBER
Administrative field used to track keyed data batches.
uparrow

Disclaimer

These data have been generated by State agencies under contract with the Department of the Interior and the Upper Midwest Environmental Sciences Center (UMESC). Although these data have been processed successfully on a computer system at the UMESC, no warranty, expressed or implied, is made regarding the accuracy or utility of the data on any other system or for general or scientific purposes, nor shall the act of distribution constitute any such warranty. However, error correction is a continuing process, and the UMESC routinely makes changes to data as errors are discovered. The file date is referred to as the release date and should be mentioned in all references to data. These data have been provided as a unit package to ensure that you receive a valid dataset collected in a consistent manner. Methods, purposes, and conditions can vary dramatically among data-collecting efforts and these data should not be combined with other datasets from the UMESC or other sources unless the user fully understands the procedures and purposes used to create each set of data and can verify that data are comparable across sources.



Page Last Modified: April 17, 2018