Prepared by
Forrest M. Council
Carolyn Williams
University of North Carolina
Highway Safety Research Center
730 Airport Road, Suite 300
Chapel Hill, NC 27599-3430
Yusuf M. Mohamedshah
LENDIS Corporation
6300 Georgetown Pike
McLean, VA 22101
Prepared for
Federal Highway Administration
Office of Safety and Traffic Operations Research and Development
U.S. Department of Transportation
Washington, DC 20590
August, 2000
(NOTE: Changes from the previous edition of the Guidebook are shown in bold and italics.)
Utah's basic highway information system is one of the most complete data base management systems found in any of the HSIS States in terms of the number of files included in the system and the flexibility of output. Since the mid-1980's, the system has either been known as the AHighway Information System@ or the ASR/Route and Traffic System.@ It has been a relational system built on a data base management system named ADABAS. The flexibility of ADABAS allows Utah to "spin off" any number of variables into different file formats. In 2000, the older IBM-based system is being converted to a Client/Server system, run on a UNIX platform, with Informix as the relational database.
The raw files requested for the HSIS include the following:
Accident data:
The decision to request subfiles rather than one large accident file which would capture all accident, vehicle, and occupant data together was made in order to be consistent with other HSIS State formats. For additional consistency, variables related to drivers which did not originally appear on the Vehicle subfile have been copied from the Occupant subfile into that subfile. For ease of use, the three accident subfiles, the Roadlog File, the Horizontal Curve File, and the Grade File have been converted to SAS files.
Raw file data are provided to the Highway Safety Research Center where they are retained as backup information. The documentation (variable listings, definitions, etc.) for these raw files and for the SAS files that are developed from them are available at FHWA offices.
Beginning in 1994, the HSIS system was converted to a relational database for internal use. This database, using a SYBASE system, stores the data received from Utah and other states, and the data files for a given state are linked and manipulated using SQL language. However, this conversion from the original SAS-based system to the newer relational system is somewhat transparent to the end-user of the data since the output files produced by SYBASE for modeling and analysis will be SAS formatted. As in the past, we have continued to produce SAS format libraries for each of the variables in each of the files. Because it is envisioned that the majority of analyses will utilize these SAS files and formats, this Guidebook will concern these SAS files -- their formats, completeness, and quality. While single-variable tables for key variables from each file were published in the past as “Volume II”, this is no longer the case. Users may request specific single- or multiple-variable tables through the HSIS “Data Request” procedure found on this web site.
As noted above, the accident data are in three separate files. The Accident Subfile (including crashes from 1985-97), containing basic information on accident type, location, environment, etc. can be linked with the Vehicle Subfile (which contains information on each vehicle in the crash and each driver) and the Occupant Subfile (which contains information on each occupant in each vehicle and non-occupants such as pedestrians, pedalcyclists, etc.) through use of the accident case number, vehicle number, and year. The Accident and other major files can be linked through use of a route-milepoint system and year with the milepoints for a given route beginning at the State line.
The Roadlog File contains yearly characteristics of 14,000 miles of the road system including shoulder and median information, pavement type and width, lane information, etc. Thus, a separate file exists for 1985, 1986, 1988, 1989, and 1991-1994, and 1996-1997 allowing correct roadway information to be linked with accidents occurring during a given year. (Note that there is no 1995 file due to data linkage problems that will be described later in this narrative.) This file is composed of homogeneous segments of all state and federal-aid roadways which are classified as collectors and above, with a new segment defined each time any of the recorded variables changes. In addition, unlike some other State data bases which use separate files, this Roadlog file also contains traffic information (AADT).
The Horizontal Curve and Vertical Grade Files were generated to produce information on specifics of curvature. This information contains degree of curve, direction of curve, and percent and direction of grade. While a new section in the Roadlog File is begun each time any variable changes (e.g., pavement type), new sections in these files begin only at the beginning and end points of curves and grades. These data can be merged with the Roadlog and Accident Files through the same route-milepoint key. (As described in more detail in the later sections, these files are no longer updated by Utah after 1994.)
Details of the three Accident subfiles, the Roadlog File, the Horizontal Curve File, and the Grade File are presented in the following section.
As with the other states in the HSIS system, the Utah accidents that are captured in the system are only those that can be linked with the Roadlog File (the roadway inventory data). In Utah, approximately 85-90 percent of the total reportable crashes in the state are linkable to the inventory information. Non-linkable crashes primarily consist of crashes on roads not mileposted or inventoried where the crash was given a Azone@ location, and crashes which can be located to routes, but not to specific mileposted locations on the routes. Most of these locations (and crashes) are on lower-order county and municipal roads.
The accident data in the State of Utah are coded and edited by staff at Utah DOT. This group locates each of the accidents based on the inputs provided on the report by the investigating officer. Currently, there are thirteen years of accident data in the Utah HSIS files -- 1985-1997. This data set includes between 35,000 and 50,000 accidents per year between 1985 and 1997 , 63,000 -95,000 vehicles per year, and 91,000 - 140,000 occupants/pedestrians per year. Unlike certain other HSIS States, the Occupant subfile contains information on each occupant in the crash, whether injured or not.
To insure data quality, coders within the DOT Division of Safety input the data received from police officers in both urban and rural locations. The same report form is used statewide with a reporting level of $750 total damage and/or personal injury. The coders in Utah DOT headquarters correct the information based on what they see on the form, and actually call the police officer back if they have some question on the accident location or other data. Thus, the feeling is that location information and other variables are approximately 95 percent correct in the rural areas. Re-verification checks for samples of these variables are conducted to check the accuracy. Information provided by the local police is not quite as accurate as that provided by the state police, but it is still considered to be quite accurate.
Based on examination of single- versus multi-vehicle crash percentages and on the fact that city police agencies investigate over 50 percent of the crashes, it appears that the Utah files are slightly weighted toward urban crashes, although not as much as certain other HSIS States.
To investigate data completeness and accuracy, Utah staff was questioned about their experience with files, and a series of single variable tabulations of more than 70 key variables were produced and examined for each year of data in the system. This allows HSIS staff to look for both significant percentages of uncoded or Aunknown@ data within each variable and for changes in individual variable codes across time. When problems are found, Utah staff is contacted to provide an explanation or to conduct further internal checks. The variables examined ranged from various accident descriptors including accident type, day of week, number of vehicles involved, accident severity, road surface, object struck, and investigating agency; to vehicle-related variables involving travel and impact speed, contributing factors and vehicle type; to occupant variables related to safety equipment use, age, sex, and alcohol use.
The quality control checks indicated some problems with a number of variables. However, the problems identified were generally minor in nature, concerned variables which will not be used very often in FHWA analyses, or were problems which can be overcome to some extent by knowledge of how the data behave.
The problems identified can be categorized into a number of categories including (1) variables which have a relatively high percentage of uncoded data; (2) changes in the 1986 accident report form and thus the coding of accident variables; (3) variables which appear to have inconsistent coding across years (often the result of the officers switching from leaving the variable uncoded to a Ano@ or Anone@ code); (4) variables which have problems that could well be related to the nature of the codes available to the investigating officer (e.g., the lack of a Anot injured@ code in the AInjured Body Area@ variable); (5) variables which appear to have problems due to possible errors in computation programs within the Utah system (which have subsequently been corrected); and (6) variables that simply represent poor data. In each of these cases where a problem is found, a ANOTE@ is included under the variable description in the SAS format section which follows. As might be expected, in some cases the data in certain variables improve over time. These instances are noted also.
With respect to the second issue, discussions with Utah staff concerning inconsistencies in the data across years brought to light the fact that there were major changes in the accident report form and thus the way certain variables were coded in 1986. For example, within the "Accident Type" variable (and the related Asubsequent event@ variables), there were no instances of "ran-off-road left or right" in 1985, but a suspiciously large number of instances of "ran-off through median" codes for that year. The explanation was that the code now used for "ran-off through median" (09) represented all ran-off-road crashes prior to 1986. The coding was changed early in 1986 to allow categorizing the ran-off-road accidents into right, left, and through median. In other cases (e.g., ATraffic Control Type"), the variable was expanded in 1986 to include additional variables.
To further investigate the accuracy of the data, comparisons were made of pairs of variables in the file which should have been somewhat similar according to their definitions. In addition, the single-variable tabulations were compared to the other HSIS States to see if large differences existed. The comparisons within the files usually indicated a high degree of consistency between similar variables.
Finally, as noted earlier, the majority of driver-related variables appear both on the Vehicle and Occupant subfiles. However, due to the layout of the original raw files, a limited number of variables (e.g., "Driver License Type," "Years Driving Experience,@ "Drivers License Restriction") appear only on the Vehicle subfile. It is noted that since there are some vehicles which would not have drivers or driver data (e.g., parked vehicles or hit-and-run vehicles), the totals at the bottom of tables for the driver variables copied from the Occupant subfile will be slightly lower than the totals for the driver variables in the original Vehicle subfile. For the latter Vehicle-subfile variables, there will be slightly more "not coded" or "unknown" driver codes.
In summary, it appears that there are a number of variables in the Utah accident files in which there are potential errors that the analyst must be aware of. Fortunately, the majority of these variables will not be primary variables in much of the analysis done by FHWA. However, the problems must be kept in mind in any analyses conducted and, as with other Guidebooks, these errors are noted within the format statements for each variable affected.
Supplemental data on a vehicles characteristics (e.g., engine information, air bag presence, wheelbase, etc.) can be developed by decoding the VIN (Vehicle Identification Number) collected on crash forms by police agencies in some states. For Accident/Vehicle Files from 1987- 94, formats for the decoded VIN files were included in the original Guidebooks, and separate VIN files were developed for each year of data for the states of Utah, Illinois, Michigan, and North Carolina. When a vehicle in the Vehicle File had a legitimate VIN, this decoding was done using the VINDICATOR program distributed by the Highway Loss Data Institute of IIHS (the Insurance Institute of Highway Safety). This detailed information could then be linked back with the vehicle file using the Accident Case Number and the Vehicle Number.
Because of the very limited use of the VIN data by researchers and difficulties we encountered with the decoding process after 1994, the VIN variable listing found in the original Guidebooks for these states has been removed from this version. However, HSIS continues to capture the VIN, and has the capabilities of decoding the VIN for users. For Utah, decoded VIN information is available for 1987-96, and VINs for later years can be decoded upon request. For more information on this data, contact the HSIS staff.
Of the approximately 50,000 miles of highway in Utah, approximately 14,000 are included in the HSIS Roadlog File for roadway inventory and accident linkage purposes. Annual files exist for 1985, 1986, 1988, 1989, 1991- 1994, and 1996 -1997. Inventory information on either the 1989 or 1991 files are felt to be suitable for use with 1990 accident data. There is no 1995 file in HSIS. Utah staff Are-mileposted@ crashes for 1995, but did not begin the corresponding Roadlog File re-mileposting until 1996. Thus, the 1995 crashes did not link accurately with the 1995 Roadlog File. Inventory information on the 1996 file is felt to be suitable for linkage with the 1995 crash data. (In some multi-year analyses conducted in the past with HSIS data, multiple years B e.g., three years B of crash data have been linked with one year of inventory data, most often the Amiddle year.@ However, this remileposting means that in such analyses, the researcher should NOT link 1994 and earlier crash data with post-1994 inventory data, or 1995 and later crash data should not be linked with pre-1996 roadway inventory data.)
As noted below, Utah staff indicates highest confidence in the accuracy of the inventory data on the 8,200 miles of roadway on the Federal Aid system and non-Federal Aid "State" or AOther Federal@ system. This includes approximately 950 miles of Interstate roadways, approximately 4,900 miles of other State-system roadways (almost all of which is also on the Federal Aid system), approximately 1,760 miles of non-State, Federal-Aid roadway (primarily Federal-Aid county and municipal roadways) and 640 miles of Aother federal@ roads (primarily US Forest roads). Of the 35,000 - 50,000 accidents that occur each year in Utah, it is estimated that approximately 75 percent occur on roads that are on the Federal Aid System or the non-Federal Aid, State system. The table below provides a categorization of all two-way paved mileage in the system (i.e., unpaved mileage and uncoded mileage is included in the AOther@ category).
Much of the information of the inventory system in Utah is an outgrowth of effort related to collecting data for the Highway Performance Monitoring System (HPMS) B both the AUniverse@ and the ASample@ files. The HPMS Sample-file represents approximately 18 percent of the functionally classifiable roadways in the State. This inventory is updated on a three-year cycle. Using the photologging system, some of the variables that are on the HPMS samples are expanded to cover part or all of the universe of roads within the HSIS Roadlog File. This expansion is being done on a year-to-year priority basis, where the priority is based on the importance of a particular variable for use. And as would be expected, variable coding completeness is related to functional classification of roadways, with the lower functional classes (e.g., minor collectors and local roads) exhibiting less complete coding.
| Roadway Category | Mileage |
|---|---|
| Urban freeways | 175.87 |
| Urban freeways < 4 Lanes | 4.01 |
| Urban multilane divided non-freeways | 383.27 |
| Urban multilane undivided non-freeways | 80.25 |
| Urban 2 lane highways | 1222.90 |
| Rural freeways | 770.69 |
| Rural freeways < 4 Lns | 0.18 |
| Rural multilane divided non-freeways | 119.11 |
| Rural multilane undivided non-freeways | 194.14 |
| Rural 2 lane highways | 6451.80 |
| Other | 3880.30 |
| Total | 13282.52 |
Through examinations of the data across years and by working with the Utah staff, the HSIS staff has attempted to define those variables which do and do not fully cover the entire HSIS system. This is difficult at times, since the expansion is a continuing process and the nature and degree of the expansion differs among variables. In some cases, the expansion may be adding additional miles of coding for a given variable. In other, new variables are added to our files in a certain year (e.g., approximately 30 variables were added in HSIS database in 1991). In addition, there are variables in which uncoded data for some HSIS sections (i.e., variables which are only collected on HPMS-sample sections) have been coded as A0" in the files we receive from Utah. Since A0" is sometimes a legitimate value, this can lead to interpretation problems. Where possible, we have attempted to recode the data based on information concerning the location of HPMS-Sample sections (i.e., replace A0" with blanks). Finally, because the HPMS Sample sections are often longer than the HSIS sections, some variables in the HPMS file would not accurately apply to each HSIS sub-section unless corrected. For example, "Number of Signalized Intersections" for an HPMS sample section which covered three Roadlog sections would not apply accurately to each of the Roadlog sections. Fortunately, for most variables of this type, the Utah staff has corrected the values so that they are accurate for the HSIS subsections.
Since this expansion from HPMS data does lead to possible interpretational and use problems with some variables, HSIS staff has examined each variable in the data across all years of the files and has worked with the Utah staff in an attempt to define possible issues. These issues are then documented in a ANOTE@ in the variable description found in the later SAS format section.
In general, variables which are essentially fully coded for all functional classifications include such key variables as county, section length, rural/urban designation, functional system, AADT, number of through lanes, access control, median type, surface/pavement type, shoulder type, national highway system indicator, and others. Most of the remainder of the variables are available for interstate, primary arterial, major collector, and urban-collector across the years, and have less full coding for minor collectors and local roads. Finally, there is a third set which is coded only for HSIS sections which fall within HPMS Sample sections. Specific information on level of coding is noted in the format section for individual variables.
As in the other HSIS states, each record on the HSIS Roadlog File represents a homogeneous section defined by a change in any of the variables existing on the file. Thus, for example, if pavement width changes for one or more sections, new section "beginnings" may be produced or old sections may be combined from one year to the next.
Approximately 70 percent of the computer records (i.e., homogeneous sections)on the file are on the federal-aid system. These federal-aid records represent approximately 56 percent of the total mileage covered in the file. Thus, the sections in the federal-aid system are somewhat shorter than the sections in the non-federal-aid system. This is logical, in that the more detailed inventory (and thus, more Abreaks@ in homogeneous sections) is found on the federal-aid system. Approximately 85 percent of the mileage in the file is rural in nature. Approximately 85 percent of the 14,000 inventoried miles are two-lane sections, and 12 percent of the mileage have four or more lanes. Approximately half of the roadways on the file are low volume in nature with 51 percent of the roadway mileage having AADT's of 500 or less vehicles per day.
To assess data accuracy and completeness, HSIS staff again relied on past analyses with the data, conversations with Utah staff, and computerized and manual comparisons across years of single-variable tabulations for key variables. As noted above, most of the variables on the 14,000 mile inventory file are coded very completely with very few missing values. However, there are high percentages of unknowns for some variables, particularly those which are only coded for HPMS Sample sections, or some expansion of that coding.
Two new variables, RODWYCLS and MVMT, have been created by HSIS staff in the roadway segment file of each of the HSIS states. The RODWYCLS (Roadway Class) variable is based on the combination of rural/urban, access control, number of lanes and median type variables. This variable classifies each roadway segment into one of ten roadway types described in the later AFormat@ section. This variable is also included as a accident-file variable by matching each crash to its corresponding roadway segment. The MVMT variable (Million Vehicle Miles of Travel) is calculated for each segment in the roadway file by multiplying the segment length, AADT and 365 days in a year, and dividing by one million. Both these variables were created in response to inquires from data users, whose most frequent questions have concerned either crash frequencies or rates (per MVMT) for one or more of these roadway classes.
Finally, added coverage of information occurs in the 1991 file where a number of new variables were added to the Roadlog File. Some refer to engineering details (e.g., thickness of slab), while others appear to be new information on traffic or other variables (e.g., number of signalized intersections). Where available, details are included in the SAS format description.
With respect to data accuracy, as noted above, the inventoried roadways in Utah are composed of three overlapping systems -- the State-controlled system (approximately 5,800 miles, of which 5,600 are also under the Federal-Aid system), additional Federal-Aid highways not on the State-controlled system (approximately 1,700 additional miles, predominantly urban roadways or secondary roadways under township or county control), and approximately 5,700 miles of inventoried roadway which falls neither on the State system nor the Federal Aid system, and which will be referred to as "non-system" roadways. As indicated earlier, the inventory data for these three types of roadways is either based on inventory efforts tied to the requirements of the HPMS system, or on older files in which data were extracted from design drawings or photologs.
Utah staff has the highest confidence in the inventory data which is based on the HPMS effort or the expansion thereof. Thus, all variables on the Federal Aid system (e.g., Interstate, Federal Aid Primary, Federal Aid Secondary and Federal Aid Urban), and all coded variables on the additional State-system routes are felt to be coded very accurately. The inventory data for the remaining 5,700 miles of "non-system" roadway are also felt to be quite good, but not as accurate as the data on the system roads simply because the inventory is not updated as often.
It is noted that as in most state inventory files, the design of the file may lead to some "gaps" in the data in route-oriented analysis. Because the file is section-based, with each section of pavement being represented only once, analyses of roadway types (e.g., all Interstate mileage, all 2-lane mileage) can be conducted accurately, with appropriate accidents being merged to the appropriate sections. However, for analyses involving a specific route, there may be gaps in the data where the route coincides with a second route. Here, the inventory data will only be captured on one of the routes (usually the higher-order route) to prevent double-counting of mileage. For example, since some sections of Interstate I-15 and I-80 coincide, it has been found that the inventory data (and linked accidents) only appear on I-15 in the file. Thus, the analysts doing a route-based analysis on I-80 will have to determine the milepoints at which the routes coincide (gaps in the I-80 file segments which can be linked to a map), and pick up the appropriate data from the I-15 file. Even larger problems would be expected to occur if route-based analysis are attempted for lower-order routes (e.g., state routes which may coincide with US routes). Again, the HSIS files were primarily designed for non-route-based analyses, and will be accurate in the majority of runs conducted.
As noted earlier, traffic data related to AADT and truck percentages are found on the Roadlog file. These data are based on Utah=s traffic count program. In this program, there are 85 permanent Automatic Traffic Recorders (ATR's) on Interstate and Utah State roads in Utah which are in operation 365 days/year. Of these, 53 ATR's capture volume and vehicle classification counts and 32 ATR's count volume only. These ATR's conform with FHWA's HPMS guidelines. In addition, there are approximately 10 additional ATR=s on roads inside National Parks in Utah that are operated by the National Park Service.
In addition to these permanent counts, Utah collects 48-hour (or more) coverage counts at approximately 1,300 locations per year. Counts on the State-system roadway are done on a 3 to 5 year cycle. Approximately 100 traffic counting machines are utilized to collect traffic data for 7,100 miles of State-system roads in Utah. In terms of coverage, Utah tends to have a better sample coverage of high volume roads compared to lower functional categories. From a purely statistical perspective, a larger sample might be more appropriate for the lower functional classes of roads. However, Utah believes that limited resources for counting should be devoted to the roads that carry a bulk of the traffic. In addition to these coverage counts, approximately 200-300 short-term vehicle classification counts are conducted each year. Finally, to help develop seasonal factors, additional seasonal counts are captured for a week per month or per season.
Short term counts are expanded to AADT estimates using ATR data for roads with similar characteristics, functional class, and volume group. For a year in which no count is made, the previous year=s count for a section is modified by a Agrowth factor@ which is based on changes in counts in the same area or on changes measured by an Aassigned@ (similar) ATR station. In this manner, volume assignments are made to each section of State system roadway each year. Finally, Utah staff also develops estimates of truck percentages and equivalent single axle loadings (ESAL=s) for Aon-system@ roadways based on the classification counts, weigh-in-motion data, interaction with trucking companies, and staff knowledge of trucking routes. Traffic information is entered into the Traffic file as it is being collected, but is transferred to the computerized system, and thus to the Roadlog File, at the end of the year.
With respect to the accuracy of the traffic information, Utah staff indicated that the data are currently being corrected such that errors would probably not be greater than +10 percent for almost all of the State-system sections. (Counts on Anon-system@ county and municipal roads would be less accurate.)
The Horizontal Curvature file has been discontinued by the State of Utah after 1994. According to the State DOT staff, the data are no longer being updated due other priorities in some districts. In addition, the staff has some questions about the accuracy of the original data input into the file. They are currently in the process of verifying data based on as-built plans. To date, even though the staff has questions about the accuracy, the verification activity has not indicated major problems with the data. However, given their concerns, and the fact that the verification process is not complete, we have chosen to leave the pre-1994 data in the HSIS files, and to await the development of the verified file. The following narrative describes the earlier files. However, we feel that the data in the existing (1994 and earlier) files should be used with some caution.
The existing horizontal curvature file contains information on horizontal curves in the State of Utah. It contains 12,976 records in 1994. This file was expanded to the full State-system in 1986 and contains information on curve and tangent sections. (Note that curve-related information does not exist for all Federal-Aid system roads, since some of these roads fall within urban areas not under State DOT control.) The annual files included in the HSIS are for 1987 and 1991-94. It is noted that the sections on this file are divided only at the beginning and end of curves, and thus the number of sections will not match the sections on the Roadlog File which are defined by changes in many other variables.
Approximately 87 percent of the 5,900 miles on this file are tangent sections. This file includes information on the degree of curve, the direction of curve, and the type of roadway on which the tangent and curve sections fall. When compared to Washington and other similar State files, the percentage of curve by degree appears logical. As noted earlier, Utah staff does have some questions concerning the accuracy of these data, and is currently in a data verification process.
Like the Horizontal Curve File, the Vertical Grade file has been discontinued by the State of Utah after 1994, due to the same updating and data verification issues. (See discussion above.) Again, the data are currently being verified using as-built plans. We have again chosen to leave the 1994 and earlier files in the HSIS system, but again note to the user that the data should be used with caution. The existing file is described in the following narrative.
The vertical curvature file contains information on the percent grade and the direction of grade (either plus or minus). Like the Horizontal Curve File, data are only available for the State-system roads -- approximately 5900 miles of roadway. Currently, the HSIS system contains six files -- 1987, 1989, and 1991-94.
As with the horizontal curvature file, the sections on this file are defined only by breaks in grade. Of the 5900 miles of roadway on this file, approximately 18 percent are characterized by zero percent grade and approximately six percent are characterized by grades steeper than five percent, which suggests that the distribution of grade is quite logical. As noted earlier, Utah staff does have some questions concerning the accuracy of these data, and is currently in a data verification process.
Because the individual Utah files are all extracted from one ADABAS database, all are linkable. As noted above, the accident data are subdivided into three subfiles -- accident, vehicle, and occupant. These subfiles can be linked together using the "case number" variable (i.e., CASENO) present in each of the three files. This variable includes the accident year. When linking the occupant subfile, the additional linking variable "vehicle number" (i.e., VEHNO) must match so that the occupants are associated with the vehicle in which they were traveling. To link the Vehicle subfile with the Accident alone, first sort both subfiles by case number. To link the Occupant file with the other two subfiles, first sort both the Vehicle subfile and Occupant subfile by case number and vehicle number. Next sort the Accident subfile by case number. Alternatively, the separate subfiles can be linked by specifying an SQL JOIN operation with the constraining condition that case number and vehicle number from each table are equal. SQL processing does not require the data to be presorted and the output will not be in any particular sort order unless ORDER BY is specified.
The Accident subfile can then be linked with the Roadlog File using information related to route number and milepost on the route. The actual linkage variables on the Accident file which are used in the merging operation are A_ROUTE (route number) and MILEPOST. The linkage variables on the Roadlog File are BEGMP, ENDMP, and RTE_NBR.
To prepare the Accident subfile for linking with the Roadlog File using a SAS data step process, the analyst must sort both the Accident and the Roadway File into location order by A_ROUTE and MILEPOST on the Accident file and by RTE_NBR and BEGMP on the Roadlog File. Similar sorts would be done with other files to be merged. For the alternative SQL join, the analyst must specify an exact match on A_ROUTE and RTE_NBR from the Accident and Roadlog files and a range match where MILEPOST occurs between BEGMP and ENDMP.
To link the Horizontal Curvature or Grade File with the Accident File, use exactly the same variables as used in linking the Accident and Roadlog Files. Merging the Curve or Grade Files with the Roadlog File is a slightly more complex section-to-section match.
Finally, where appropriate and possible, a format which defines categories within a given variable has been developed for HSIS SAS variables. These categories are shown in the pages below. If you are a SAS user and wish to receive a formatting program which includes these SAS formats (with linkage to the pertinent variable name), please request these from the HSIS staff who provides the data file to you.
State computer files and accident information -- Rick Julio (801-965-4268) -- Mr. Julio is our main contact within the State of Utah when questions arise concerning both the data files in general and specific information related to the accident and roadway variables. He is the Roadway Geometric Engineer in the Utah DOT, and has oversight responsibilities for roadway and accident files.
Traffic Engineering information B Mack Christensen (801-965-4264) is the Traffic and Safety Studies Engineer with the Utah DOT. He can provide additional information on uses of the Utah data in safety studies, particularly as related to traffic engineering studies.
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| A_ROUTE | ACCIDENT ROUTE NUMBER | Accident | CHA(6) | I-25 |
| A_RTCODE | ROUTE CODE | Accident | CHA(1) | I-25 |
| AADT | AADT (BOTH DIRECTIONS) | Roadlog | NUM | I-69 |
| AADTGRP | AADT VOLUME GROUP | Roadlog | CHA(2) | I-69 |
| ACC_PNTS | NUMBER OF MAJOR ACCESS POINTS | Roadlog | CHA(2) | I-70 |
| ACCESS | ACCESS CONTROL | Roadlog | CHA(1) | I-70 |
| ACCTYPE | TYPE OF ACCIDENT (FIRST EVENT) | Accident | CHA(1) | I-25 |
| ACCYR | ACCIDENT YEAR | Accident | CHA(4) | I-26 |
| AGE | DRIVER/OCCUPANT AGE | Occupant | NUM | I-61 |
| AGENCY | INVESTIGATING OFFICER | Accident | CHA(1) | I-26 |
| ALT_VEH | ALTERED VEHICLE | Vehicle | CHA(1) | I-41 |
| APP_WD | APPROACH WIDTH | Roadlog | NUM | I-70 |
| BEGMP | BEGIN MILEPOST | Roadlog | NUM | I-70 |
| BEGMP | BEGIN MILEPOST | Curve | NUM | I-91 |
| BEGMP | BEGIN MILEPOST | Grade | NUM | I-95 |
| CAPACITY | CAPACITY | Grade | NUM | I-95 |
| CARGO | DESCRIPTION OF CARGO | Vehicle | CHA(1) | I-41 |
| CARR_ADD | CARRIER ADDRESS | Vehicle | CHA(30) | I-41 |
| CARR_CTY | CARRIER CITY | Vehicle | CHA(20) | I-42 |
| CARR_NAM | CARRIER NAME | Vehicle | CHA(54) | I-42 |
| CARR_STE | CARRIER STATE | Vehicle | CHA(2) | I-42 |
| CARR_ZIP | CARRIER ZIP CODE | Vehicle | CHA(10) | I-42 |
| CASENO | ACCIDENT YEAR + CASE NUMBER | Accident | CHA(9) | I-26 |
| CASENO | ACCIDENT YEAR + CASE NUMBER | Vehicle | CHA(9) | I-42 |
| CASENO | ACCIDENT YEAR + CASE NUMBER | Occupant | CHA(9) | I-61 |
| COL_TYPE | COLLISION TYPE BASED ON VEHICLE MOTIONS | Accident | CHA(2) | I-26 |
| COMM_TRK | COMMERCIAL TRUCK | Vehicle | CHA(1) | I-42 |
| COMVEH | TRUCK/COMMERCIAL VEHICLE ROUTES | Roadlog | CHA(1) | I-71 |
| CONTRIB1 | FIRST CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| CONTRIB2 | SECOND CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| CONTRIB3 | THIRD CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| COUNTY | FIPS COUNTY CODE | Roadlog | CHA(3) | I-71 |
| DAMAGE2 | PART DAMAGED #2 | Vehicle | CHA(1) | I-44 |
| DAMAGE3 | PART DAMAGED #3 | Vehicle | CHA(1) | I-44 |
| DAMAGE4 | PART DAMAGED #4 | Vehicle | CHA(1) | I-44 |
| DAMAGE5 | PART DAMAGED #5 | Vehicle | CHA(1) | I-44 |
| DAMAGE6 | PART DAMAGED #6 | Vehicle | CHA(1) | I-44 |
| DAMAGE7 | PART DAMAGED #7 | Vehicle | CHA(1) | I-44 |
| DAMAGE8 | PART DAMAGED #8 | Vehicle | CHA(1) | I-44 |
| DAMAGE9 | PART DAMAGED #9 | Vehicle | CHA(1) | I-44 |
| DAYMTH | DAY OF MONTH | Accident | NUM | I-27 |
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| DEG_CURV | DEGREE OF CURVATURE | Curve | NUM | I-91 |
| DESG_SPD | WEIGHTED DESIGN SPEED | Roadlog | NUM | I-72 |
| DIR_CURV | DIRECTION OF CURVE | Curve | CHA(1) | I-91 |
| DIR_FACT | DIRECTIONAL FACTOR | Roadlog | CHA(3) | I-72 |
| DIR_GRAD | DIRECTION OF GRADE | Grade | CHA(1) | I-95 |
| DIR_TRVL | DIRECTION OF TRAVEL | Vehicle | CHA(1) | I-44 |
| DR_BDATE | DRIVER DATE OF BIRTH | Vehicle | CHA(8) | I-44 |
| DR_EDUC | DRIVER EDUCATION | Vehicle | CHA(1) | I-45 |
| DR_EJECT | DRIVER EJECTION Vehicle | CHA(1) | I-45 | II-90 |
| DR_INJAR | DRIVER INJURED BODY AREA | Vehicle | CHA(1) | I-45 |
| DR_INJCS | DRIVER INJURY CAUSE | Vehicle | CHA(1) | I-45 |
| DRAINAGE | DRAINAGE ADEQUACY | Roadlog | CHA(1) | I-72 |
| DRV_AGE | DRIVER AGE | Vehicle | NUM | I-46 |
| DRV_BAC | DRIVER ALCOHOL PERCENT | Vehicle | NUM | I-46 |
| DRV_INJ | DRIVER INJURY TYPE | Vehicle | CHA(1) | I-46 |
| DRV_REST | DRIVER SAFETY EQUIPMENT | Vehicle | CHA(1) | I-47 |
| DRV_SEX | DRIVER SEX | Vehicle | CHA(1) | I-47 |
| DRV_STAT | DRIVER LICENSE STATE | Vehicle | CHA(2) | I-47 |
| EJECT | OCCUPANT EJECTION | Occupant | CHA(1) | I-61 |
| ENDMP | END MILEPOST | Roadlog | NUM | I-72 |
| ENDMP | END MILEPOST | Curve | NUM | I-91 |
| ENDMP | END MILEPOST | Grade | NUM | I-95 |
| EVENT2 | FIRST SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| EVENT3 | SECOND SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| EVENT4 | THIRD SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| FAID_URB | FAID URBAN AREA | Roadlog | CHA(5) | I-72 |
| FED_AID | FEDERAL-AID SYSTEM | Roadlog | CHA(1) | I-73 |
| FED_STAS | FEDERAL-AID SYSTEM STATUS | Roadlog | CHA(1) | I-73 |
| FUNC_CLS | FUNCTIONAL CLASSIFICATION | Roadlog | CHA(2) | I-73 |
| FUT_ADT | FUTURE ADT | Roadlog | NUM | I-73 |
| HOR_ACC | HORIZONTAL ALIGNMENT ADEQUACY | Roadlog | CHA(1) | I-74 |
| HOUR | TIME OF ACCIDENT (HOUR ONLY) | Accident | NUM | I-28 |
| IMPT_SPD | IMPACT SPEED | Vehicle | NUM | I-48 |
| INJ | INJURY TYPE | Occupant | CHA(1) | I-62 |
| INJ_AREA | INJURY BODY AREA | Occupant | CHA(1) | I-62 |
| INJCAUSE | INJURY CAUSE | Occupant | CHA(1) | I-62 |
| INSPECT | SAFETY INSPECTION | Vehicle | CHA(1) | I-48 |
| INT_TYPE | INTERSECTION TYPE | Accident | CHA(1) | I-29 |
| K_FACTOR | K FACTOR | Roadlog | CHA(2) | I-74 |
| LANEWID | AVER THROUGH LANE WIDTH | Roadlog | NUM | I-74 |
| LEVL_CNT | GOVERNMENTAL OWNERSHIP | Roadlog | CHA(2) | I-75 |
| LICRESTR | DRIVER LICENSE RESTRICTION | Vehicle | CHA(2) | I-48 |
| LICTYP | DRIVER LICENSE TYPE | Vehicle | CHA(1) | I-49 |
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| LIGHT | LIGHT CONDITIONS | Accident | CHA(1) | I-29 |
| LOCALITY | KIND OF LOCALITY | Accident | CHA(1) | I-29 |
| LOCATN | LOCATION | Accident | CHA(1) | I-29 |
| MED_TYPE | TYPE OF MEDIAN | Roadlog | CHA(1) | I-75 |
| MEDWID | PREDOMINANT MEDIAN WIDTH | Roadlog | NUM | I-76 |
| MILEPOST | ACCIDENT MILEPOST | Accident | NUM | I-30 |
| MISCACT1 | DRIVER INTENT | Vehicle | CHA(2) | I-49 |
| MONTH | MONTH | Accident | NUM | I-30 |
| MVMT | MILLION VEHICLE MILES TRAVELED | Roadlog | NUM | I-76 |
| NBR_INT | NUM. OF GRADE-SEPARATED INTERCHANGES | Roadlog | NUM | I-76 |
| NHS_IND | NATIONAL HIGHWAY SYSTEM INDICATOR | Roadlog | CHA(1) | I-76 |
| NO_LANES | TOTAL NUM OF TRAFFIC LANES | Roadlog | NUM | I-76 |
| NO_OTR | NUM. INTERSECTION, NO CONTROL | Roadlog | NUM | I-77 |
| NO_SGNL | NUM. SIGNALIZED INTERSECTIONS | Roadlog | NUM | I-77 |
| NO_SIGN | NUM. INTERSECTION, STOP SIGN | Roadlog | NUM | I-77 |
| NOCC_BAC | NON-OCCUPANT ALCOHOL PERCENT | Occupant | NUM | I-63 |
| NOCC_FAT | TOTAL NON-OCCUPANTS KILLED | Accident | NUM | I-30 |
| NOCC_INJ | TOTAL NON-OCCUPANTS INJURED | Accident | NUM | I-30 |
| NOCC_TST | NON-OCCUPANT ALCOHOL TEST TYPE | Occupant | CHA(1) | I-63 |
| NOCCSEV | SUM OF NON-OCCUPANT SEVERITY | Accident | NUM | I-30 |
| NUM_OCCS | NUMBER OF OCCUPANTS IN ACCIDENT | Vehicle | NUM | I-50 |
| NUMVEHS | NUMBER OF VEHICLES IN CRASH | Accident | NUM | I-31 |
| OBJECT1 | OBJECT STRUCK | Accident | CHA(1) | I-31 |
| OBJECT1 | OBJECT STRUCK | Vehicle | CHA(1) | I-50 |
| OCCSEV | SUM OF OCCUPANT SEVERITY CODES | Accident | NUM | I-32 |
| ONEWAY | ONE-WAY OR TWO-WAY FACILITY | Roadlog | CHA(1) | I-77 |
| PAV_SEC | PAVEMENT SECTION | Roadlog | CHA(1) | I-77 |
| PAV_WDL | PAVED LEFT SHOULDER WIDTH | Roadlog | NUM | I-78 |
| PAV_WIDR | PAVED RIGHT SHOULDER WIDTH | Roadlog | NUM | I-78 |
| PAVECOND | PRESENT SERVICE RATING | Roadlog | CHA(2) | I-78 |
| PCT_GRAD | PERCENT OF GRADIENT | Grade | NUM | I-95 |
| PCT_GREN | PERCENT GREEN TIME | Roadlog | NUM | I-78 |
| PCT_TRK | OFF-PEAK PERCENT TRUCKS | Roadlog | NUM | I-79 |
| PCTSIGHT | PERCENT PASSING SIGHT DISTANCE | Roadlog | NUM | I-79 |
| PEAK_PRK | PEAK PARKING | Roadlog | CHA(1) | I-79 |
| PEAK_TRK | PEAK PERCENT TRUCKS | Roadlog | NUM | I-80 |
| PEAKCAPT | PEAK CAPACITY | Roadlog | CHA(5) | I-80 |
| PEDACT | PEDESTRIAN/BICYCLIST ACTION | Accident | CHA(2) | I-32 |
| PEDACT | PEDESTRIAN/BICYCLIST ACTION | Occupant | CHA(2) | I-63 |
| PROPDAM | TOTAL PROPERTY DAMAGE | Accident | CHA(1) | I-33 |
| RAMP_NO | RAMP NUMBER | Accident | CHA(2) | I-33 |
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| RD_CHAR1 | ROAD CHARACTER | Accident | CHA(1) | I-33 |
| RD_DEF | ROAD DEFECT/CONDITION | Accident | CHA(1) | I-33 |
| RDSURF | ROAD SURFACE CONDITIONS | Accident | CHA(1) | I-34 |
| RESIDLOC | RESIDENCE OF DRIVER | Vehicle | CHA(1) | I-51 |
| REST1 | SAFETY EQUIPMENT | Occupant | CHA(1) | I-64 |
| RODWYCLS | ROADWAY CLASSIFICATION | Accident | CHA(2) | I-34 |
| RODWYCLS | ROADWAY CLASSIFICATION | Roadlog | CHA(2) | I-80 |
| ROW | RIGHT OF WAY WIDTH | Roadlog | NUM | I-80 |
| RSHL_TYP | SHOULDER TYPE (RIGHT SIDE) | Roadlog | CHA(1) | I-81 |
| RTE_ID | TRAFFIC ROUTE ID | Roadlog | CHA(5) | I-81 |
| RTE_NBR | UNIQUE ROUTE/BLOC DESIGN | Roadlog | CHA(6) | I-82 |
| RTE_NBR | UNIQUE ROUTE/BLOC DESIGN | Curve | CHA(6) | I-91 |
| RTE_NBR | UNIQUE ROUTE/BLOC DESIGN | Grade | CHA(6) | I-95 |
| RTE_TYPE | ROUTE TYPE | Roadlog | CHA(1) | I-82 |
| RTTYP_AC | ROUTE TYPE - ACCIDENT | Accident | CHA(1) | I-35 |
| RURURB | RURAL/URBAN DESIGNATION | Roadlog | CHA(1) | I-82 |
| SAM_SUB | HPMS SAMPLE SUBDIVISION | Roadlog | CHA(1) | I-83 |
| SAMP_NB | HPMS SAMPLE NUMBER | Roadlog | CHA(12) | I-83 |
| SEATPOS | OCCUPANT PLACEMENT | Occupant | CHA(3) | I-65 |
| SEG_LNG | SECTION LENGTH IN MILES | Roadlog | NUM | I-83 |
| SEG_LNG | SECTION LENGTH IN MILES | Curve | NUM | I-91 |
| SEG_LNG | SECTION LENGTH IN MILES | Grade | NUM | I-95 |
| SEVERITY | SEVERITY | Accident | CHA(1) | I-35 |
| SEX | DRIVER/OCCUPANT SEX | Occupant | CHA(1) | I-66 |
| SIGNAL | PREVAILING SIGNAL TYPE | Roadlog | CHA(1) | I-83 |
| SLAB_THK | STRUCT-NO-OR-SLAB THICKNESS | Roadlog | CHA(2) | I-83 |
| SOB_TEST | DRIVER ALCOHOL TEST TYPE | Vehicle | CHA(1) | I-52 |
| SPD_LIMT | POSTED DAYLIGHT SPEED LIMIT | Roadlog | NUM | I-84 |
| SPDLIMIT | POSTED SPEED LIMIT | Vehicle | NUM | I-52 |
| SURF_TYP | PAVEMENT TYPE | Roadlog | CHA(2) | I-84 |
| TAKEBY | INJURED TAKEN BY | Accident | CHA(1) | I-35 |
| TERRAIN | PREDOMINANT TERRAIN TYPE | Roadlog | CHA(1) | I-85 |
| TIME | TIME OF ACCIDENT | Accident | CHA(4) | I-35 |
| TIMEARR | TIME OF ARRIVAL ON SCENE | Accident | CHA(4) | I-35 |
| TIMECALL | TIME OF CALL | Accident | CHA(5) | I-36 |
| TIMENOTE | TIME WHEN DISPATCH WAS NOTIFIED | Accident | CHA(4) | I-36 |
| TOLL | TOLL | Roadlog | CHA(1) | I-85 |
| TOT_INJ | TOTAL OCCUPANTS INJURED | Accident | NUM | I-36 |
| TOT_KILL | TOTAL OCCUPANTS KILLED | Accident | NUM | I-36 |
| TOT_OCC | TOTAL NUMBER OF PEOPLE IN ACCIDENT | Accident | NUM | I-36 |
| TREATBY | FIRST AID GIVEN BY | Accident | CHA(2) | I-36 |
| TRF_CNTL | TRAFFIC CONTROL TYPE | Accident | CHA(1) | I-37 |
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| TRVL_SPD | TRAVEL SPEED | Vehicle | NUM | I-53 |
| TYPE_DEV | TYPE DEVELOPMENT | Roadlog | CHA(1) | I-85 |
| TYPE_IMP | TYPE IMPROVEMENT | Roadlog | CHA(2) | I-85 |
| URB_LOC | URBAN LOCATION | Roadlog | CHA(1) | I-86 |
| USE_FLG | USE FLAG | Roadlog | CHA(1) | I-86 |
| V_DAMAGE | VEHICLE DAMAGE | Vehicle | CHA(1) | I-53 |
| VEHNO | VEHICLE NUMBER | Vehicle | NUM | I-53 |
| VEHNO | VEHICLE NUMBER | Occupant | NUM | I-66 |
| VEHTYPE | BODY STYLE/VEHICLE TYPE | Vehicle | CHA(2) | I-54 |
| VEHYR | VEHICLE MODEL YEAR | Vehicle | CHA(4) | I-55 |
| VER_CODE | VERIFICATION CODE | Accident | CHA(1) | I-37 |
| VERT_ALN | VERTICAL ALIGNMENT ADEQUACY | Roadlog | CHA(1) | I-87 |
| VIN | VEH IDENTIFICATION NUMBER | Vehicle | CHA(17) | I-56 |
| VIOL | VIOLATION CODE | Vehicle | CHA(3) | I-56 |
| VISION | DRIVER VISION OBSCURED | Vehicle | CHA(2) | I-56 |
| WD_FEAS | WIDENING FEASIBILITY | Roadlog | CHA(1) | I-87 |
| WEATHER | WEATHER | Accident | CHA(1) | I-37 |
| WEEKDAY | DAY OF WEEK | Accident | NUM | I-38 |
| XSECT | INTERSECTION CODE | Accident | CHA(3) | I-38 |
| YR | YEAR (4 DIGITS) | Roadlog | CHA(4) | I-87 |
| YR | YEAR (4 DIGITS) | Curve | CHA(4) | I-92 |
| YR | YEAR (4 DIGITS) | Grade | CHA(4) | I-96 |
| YRS_DRV | YEARS DRIVING EXPERIENCE | Vehicle | NUM | I-57 |
| 2 BY 2 TABLE CODE | ||||
| RODWYCLS BY ACCTYPE | Accident | |||
| RODWYCLS BY LIGHT | Accident | |||
| RODWYCLS BY SEVERITY | Accident | |||
| RODWYCLS BY WEATHER | Accident |
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| A_ROUTE | ACCIDENT ROUTE NUMBER | Accident | CHA(6) | I-25 |
| A_RTCODE | ROUTE CODE | Accident | CHA(1) | I-25 |
| ACCTYPE | TYPE OF ACCIDENT (FIRST EVENT) | Accident | CHA(1) | I-25 |
| ACCYR | ACCIDENT YEAR | Accident | CHA(4) | I-26 |
| AGENCY | INVESTIGATING OFFICER | Accident | CHA(1) | I-26 |
| CASENO | ACCIDENT YEAR + CASE NUMBER | Accident | CHA(9) | I-26 |
| COL_TYPE | COLLISION TYPE BASED ON VEHICLE MOTIONS | Accident | CHA(2) | I-26 |
| DAYMTH | DAY OF MONTH | Accident | NUM | I-27 |
| EVENT2 | FIRST SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| EVENT3 | SECOND SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| EVENT4 | THIRD SUBSEQUENT EVENT | Accident | CHA(1) | I-27 |
| HOUR | TIME OF ACCIDENT (HOUR ONLY) | Accident | NUM | I-28 |
| INT_TYPE | INTERSECTION TYPE | Accident | CHA(1) | I-29 |
| LIGHT | LIGHT CONDITIONS | Accident | CHA(1) | I-29 |
| LOCALITY | KIND OF LOCALITY | Accident | CHA(1) | I-29 |
| LOCATN | LOCATION | Accident | CHA(1) | I-29 |
| MILEPOST | ACCIDENT MILEPOST | Accident | NUM | I-30 |
| MONTH | MONTH | Accident | NUM | I-30 |
| NOCC_FAT | TOTAL NON-OCCUPANTS KILLED | Accident | NUM | I-30 |
| NOCC_INJ | TOTAL NON-OCCUPANTS INJURED | Accident | NUM | I-30 |
| NOCCSEV | SUM OF NON-OCCUPANT SEVERITY | Accident | NUM | I-30 |
| NUMVEHS | NUMBER OF VEHICLES IN CRASH | Accident | NUM | I-31 |
| OBJECT1 | OBJECT STRUCK | Accident | CHA(1) | I-31 |
| OCCSEV | SUM OF OCCUPANT SEVERITY CODES | Accident | NUM | I-32 |
| PEDACT | PEDESTRIAN/BICYCLIST ACTION | Accident | CHA(2) | I-32 |
| PROPDAM | TOTAL PROPERTY DAMAGE | Accident | CHA(1) | I-33 |
| RAMP_NO | RAMP NUMBER | Accident | CHA(2) | I-33 |
| RD_CHAR1 | ROAD CHARACTER | Accident | CHA(1) | I-33 |
| RD_DEF | ROAD DEFECT/CONDITION | Accident | CHA(1) | I-33 |
| RDSURF | ROAD SURFACE CONDITIONS | Accident | CHA(1) | I-34 |
| RODWYCLS | ROADWAY CLASSIFICATION | Accident | CHA(2) | I-34 |
| RTTYP_AC | ROUTE TYPE - ACCIDENT | Accident | CHA(1) | I-35 |
| SEVERITY | SEVERITY | Accident | CHA(1) | I-35 |
| TAKEBY | INJURED TAKEN BY | Accident | CHA(1) | I-35 |
| TIME | TIME OF ACCIDENT | Accident | CHA(4) | I-35 |
| TIMEARR | TIME OF ARRIVAL ON SCENE | Accident | CHA(4) | I-35 |
| TIMECALL | TIME OF CALL | Accident | CHA(5) | I-36 |
| TIMENOTE | TIME WHEN DISPATCH WAS NOTIFIED | Accident | CHA(4) | I-36 |
| TOT_INJ | TOTAL OCCUPANTS INJURED | Accident | NUM | I-36 |
| TOT_KILL | TOTAL OCCUPANTS KILLED | Accident | NUM | I-36 |
| TOT_OCC | TOTAL NUMBER OF PEOPLE IN ACCIDENT | Accident | NUM | I-36 |
| TREATBY | FIRST AID GIVEN BY | Accident | CHA(2) | I-36 |
| TRF_CNTL | TRAFFIC CONTROL TYPE | Accident | CHA(1) | I-37 |
| VER_CODE | VERIFICATION CODE | Accident | CHA(1) | I-37 |
| WEATHER | WEATHER | Accident | CHA(1) | I-37 |
| WEEKDAY | DAY OF WEEK | Accident | NUM | I-38 |
| XSECT | INTERSECTION CODE | Accident | CHA(3) | I-38 |
| RODWYCLS BY ACCTYPE | Accident | |
| RODWYCLS BY LIGHT | Accident | |
| RODWYCLS BY SEVERITY | Accident | |
| RODWYCLS BY WEATHER | Accident |
NOTE: SAS variable names and explanatory names are shown above each listing (See Discussion for information on SAS formats.).
NOTE: (1) This variable represents the first event in the crash sequence. For subsequent events see also EVENT2 (first subsequent event), EVENT3 (second subsequent event), and EVENT4 (third subsequent event). It is noted that EVENT3 (second subsequent event) contains little information - 90% - 100% of the data are 'NOT CODED'. For similar reasons, EVENT4 was deleted after 1994.
(2) Coding for this variable changed early in 1986, when the "letter" codes were added. Thus, the 1985/early 1986 data for code '9' (RAN OFF THRU MED) are in error. In 1985 and part of 1986, this code represents the total number of ran-off-road crashes.
*New codes added in 1995.
NOTE: It appears that coding for certain categories in this variable changed in 1993 and later years. Categories 04-07 increased while 08-09 decreased.
NOTE: (1) These variables represent the second, third, and fourth events in the crash sequence. For first event, see ACC_TYP. However, Subsequent EVENT4 is not collected after 1994, and the included data may be erroneous. Do not use as an analysis variable.
(2) Coding for this variable changed in 1986, when the "letter" codes were added. Thus, the 1985 data for code '9' (RAN OFF THRU MED) are in error. In 1985, this code represents the total number of ran-off-road crashes.
NOTE: The value '24' is an invalid value. According to Utah staff, the small number of cases could possible be combined with data coded as '0'.
NOTE: New variable added in 1995.
NOTE: Prior to 1988, code 2 was "DAWN OR DUSK". Code 5 added in 1989.
NOTE: New variable added in 1995.
NOTE: This variable contains certain cases coded as "0" vehicles. These are errors.
1986 and Later Codes
Pre-1986 Codes
NOTE: (1) The coding for this variable changed in 1986. As shown above, the pre-1986 codes were numeric and values will be preceded by an asterisk when they appear in tables. In addition, code 'U' (End treatment for concrete barrier) was added in 1992.
(2) This variable is an Accident File variable through 1994, and then is switched to a Vehicle File variable starting in 1995.
*New codes added in 1994.
NOTE: This variable is in the Accident File through 1994, and is converted to an Occupant File variable for 1995 and later.
NOTE: All defects increase in 1987. See Discussion.
NOTE: Created variable added to HSIS accident and roadway inventory files in all states in 1999. See Discussion.
NOTE: (1) When an accident occurs on private property (code "5"), no vehicle or occupant information is computerized for cost reasons.
(2) 1985 data for codes '2' and '3' are not consistent with similar data from later years.
(3) Data are no longer coded for 1995 and later years.
NON-LABELED VARIABLE -- FORMAT HHMM WHERE
NON-LABELED VARIABLE -- FORMAT HHMM WHERE
NON-LABELED VARIABLE -- FORMAT HHMM WHERE
NOTE: More than 10 percent of "0" codes in 1987 data. This could be due to a Utah program error.
NOTE: The 1985 data have leading zeros.
NOTE: Letter codes added in 1986; thus 1985 data are not consistent with later years.
NOTE: Codes 8 and 9 new in 1992.
NOTE: The above coding is based on the intersection sketch provided by the officer. The orientations are relative to North. Thus, B00 is a T-intersection with the base of the "T" toward the west (left). The data are not available after 1995.
| SAS VARIABLE NAME | DESCRIPTION | FILE | SAS VARIABLE TYPE | FORMAT PAGE NO. |
|---|---|---|---|---|
| ALT_VEH | ALTERED VEHICLE | Vehicle | CHA(1) | I-41 |
| CARGO | DESCRIPTION OF CARGO | Vehicle | CHA(1) | I-41 |
| CARR_ADD | CARRIER ADDRESS | Vehicle | CHA(30) | I-41 |
| CARR_CTY | CARRIER CITY | Vehicle | CHA(20) | I-42 |
| CARR_NAM | CARRIER NAME | Vehicle | CHA(54) | I-42 |
| CARR_STE | CARRIER STATE | Vehicle | CHA(2) | I-42 |
| CARR_ZIP | CARRIER ZIP CODE | Vehicle | CHA(10) | I-42 |
| CASENO | ACCIDENT YEAR + CASE NUMBER | Vehicle | CHA(9) | I-42 |
| COMM_TRK | COMMERCIAL TRUCK | Vehicle | CHA(1) | I-42 |
| CONTRIB1 | FIRST CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| CONTRIB2 | SECOND CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| CONTRIB3 | THIRD CONTRIBUTING CIRCUMSTANCES | Vehicle | CHA(2) | I-42 |
| DAMAGE2 | PART DAMAGED #2 | Vehicle | CHA(1) | I-44 |
| DAMAGE3 | PART DAMAGED #3 | Vehicle | CHA(1) | I-44 |
| DAMAGE4 | PART DAMAGED #4 | Vehicle | CHA(1) | I-44 |
| DAMAGE5 | PART DAMAGED #5 | Vehicle | CHA(1) | I-44 |
| DAMAGE6 | PART DAMAGED #6 | Vehicle | CHA(1) | I-44 |
| DAMAGE7 | PART DAMAGED #7 | Vehicle | CHA(1) | I-44 |
| DAMAGE8 | PART DAMAGED #8 | Vehicle | CHA(1) | I-44 |
| DAMAGE9 | PART DAMAGED #9 | Vehicle | CHA(1) | I-44 |
| DIR_TRVL | DIRECTION OF TRAVEL | Vehicle | CHA(1) | I-44 |
| DR_BDATE | DRIVER DATE OF BIRTH | Vehicle | CHA(8) | I-44 |
| DR_EDUC | DRIVER EDUCATION | Vehicle | CHA(1) | I-45 |
| DR_EJECT | DRIVER EJECTION | Vehicle | CHA(1) | I-45 |
| DR_INJAR | DRIVER INJURED BODY AREA | Vehicle | CHA(1) | I-45 |
| DR_INJCS | DRIVER INJURY CAUSE | Vehicle | CHA(1) | I-45 |
| DRV_AGE | DRIVER AGE | Vehicle | NUM | I-46 |
| DRV_BAC | DRIVER ALCOHOL PERCENT | Vehicle | NUM | I-46 |
| DRV_INJ | DRIVER INJURY TYPE | Vehicle | CHA(1) | I-46 |
| DRV_REST | DRIVER SAFETY EQUIPMENT | Vehicle | CHA(1) | I-47 |
| DRV_SEX | DRIVER SEX | Vehicle | CHA(1) | I-47 |
| DRV_STAT | DRIVER LICENSE STATE | Vehicle | CHA(2) | I-47 |
| IMPT_SPD | IMPACT SPEED | Vehicle | NUM | I-48 |
| INSPECT | SAFETY INSPECTION | Vehicle | CHA(1) | I-48 |
| LICRESTR | DRIVER LICENSE RESTRICTION | Vehicle | CHA(2) | I-48 |
| LICTYP | DRIVER LICENSE TYPE | Vehicle | CHA(1) | I-49 |
| MISCACT1 | DRIVER INTENT | Vehicle | CHA(2) | I-49 |
| NUM_OCCS | NUMBER OF OCCUPANTS IN ACCIDENT | Vehicle | NUM | I-50 |
| OBJECT1 | OBJECT STRUCK | Vehicle | CHA(1) | I-50 |
| RESIDLOC | RESIDENCE OF DRIVER | Vehicle | CHA(1) | I-51 |
| SOB_TEST | DRIVER ALCOHOL TEST TYPE | Vehicle | CHA(1) | I-52 |
| SPDLIMIT | POSTED SPEED LIMIT | Vehicle | NUM | I-52 |
| TRVL_SPD | TRAVEL SPEED | Vehicle | NUM | I-53 |
| V_DAMAGE | VEHICLE DAMAGE | Vehicle | CHA(1) | I-53 |
| VEHNO | VEHICLE NUMBER | Vehicle | NUM | I-53 |
| VEHTYPE | BODY STYLE/VEHICLE TYPE | Vehicle | CHA(2) | I-54 |
| VEHYR | VEHICLE MODEL YEAR | Vehicle | CHA(4) | I-55 |
| VIN | VEH IDENTIFICATION NUMBER | Vehicle | CHA(17) | I-56 |
| VIOL | VIOLATION CODE | Vehicle | CHA(3) | I-56 |
| VISION | DRIVER VISION OBSCURED | Vehicle | CHA(2) | I-56 |
| YRS_DRV | YEARS DRIVING EXPERIENCE | Vehicle | NUM | I-57 |
NOTE: (1) SAS variable names and explanatory names are shown above each listing (See Discussion for information on SAS formats.).
(2) For consistency with other State's files and ease of handling, driver-related variables have been included in this Vehicle Subfile as well as in the Occupant Subfile.
NOTE: New variable added in 1995.
NOTE: The proportions of 'not stated' and 'NONE' vary inconsistently from year to year. Officers may be failing to enter the 'none' code when no factor is present. Codes '42'-'45' are added in 1992. Codes '47'-'52' are added in 1995. CONTRIB3 was not collected after 1989.
NOTE: See V_DAMAGE for initial damage. Note that '0' can mean either that the vehicle was undamaged or totaled, so use with caution. DAMAGE7-DAMAGE9 were added in 1995.
NOTE: More than 5% is uncoded. It appears that officers may not be entering the 'unknown' code.
NOTE: Variable uncoded or 'UNKNOWN' in approximate 80% of cases. However, a large proportion of this is due to the lack of a 'NO INJURY' code within this variable.
NOTE: Variable uncoded or 'UNKNOWN' in approximate 80% of cases. However, a large proportion of this is due to the lack of a 'NO INJURY' code within this variable.
NOTE: The total proportion of positive BAC's and the portion of '>= 0.31' BAC's are both suspect.
*New codes added in 1994.
NOTE: The proportion of belts used appears to be higher than expected, particularly in 1987. As in other States, this could be the effect of occupants providing incorrect information due to the mandatory belt law.
NOTE: Conflicting raw file documentation. Above coding is correct according to Utah.
NOTE: Approximately 50% of the cases are coded as "00" = "NOT MOVING". This could be a valid code for parked or stopped vehicles. However, it is also the default code for uncoded cases. Use with caution.
NOTE: Uncoded data has increased to 99% in 1987. Do not use as analysis variable. The variable is deleted in 1995.
NOTE: The proportions of 'NO RESTRICTIONS' and >NOT CODED= data vary dramatically across years. It appears that both mean "no restrictions." In addition, the codes for certain categories were changed and/or combined sometime between 1990 and 1992. While we have attempted to accurately format the data, be aware that some of the coding in those years is suspect.
NOTE: Over 5% of the cases are uncoded in 1986 and 1987.