Airline Industry Datasets
The following datasets are freely available from the US Department of Transportation.
Origin and Destination Survey (DB1B)
The Airline Origin and Destination Survey Databank 1B (DB1B) is a 10% random sample of airline passenger tickets. It consists of three tables: Coupon, Market, and Ticket.
Frequency: Quarterly
Range: 1993–Present
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://www.transtats.bts.gov/TableInfo.asp?DB_ID=125
The columns listed for each table below reflect the columns available in the prezipped CSV files avaliable at TranStats.
DB1B Coupon
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://www.transtats.bts.gov/Fields.asp?Table_ID=289.
This table provides coupon-specific information for each domestic itinerary of the Origin and Destination Survey, such as the operating carrier, origin and destination airports, number of passengers, fare class, coupon type, trip break indicator, and distance.
Columns:
Column | Description | Notes |
---|---|---|
ItinID | Itinerary ID | Foreign key to DB1BTicket |
MktID | Market ID | Foreign key to DB1BMarket |
SeqNum | Coupon Sequence Number | Itinerary-level |
Coupons | Number of Coupons in the Itinerary | Also in DB1BTicket |
Year | Year | Also in DB1BTicket |
Quarter | Quarter (1-4) | Also in DB1BTicket |
Origin | Origin Airport Code | IATA Airport Code |
OriginAptInd | Origin Airport, Multiple Airports Indicator | - |
OriginCityNum | Origin Airport, City Code | - |
OriginCountry | Origin Airport, Country Code | - |
OriginStateFips | Origin Airport, State FIPS Code | - |
OriginState | Origin Airport, State Code | - |
OriginStateName | Origin State Name | - |
OriginWac | Origin Airport, World Area Code | - |
Dest | Destination Airport Code | IATA Airport Code |
DestAptInd | Destination Airport, Multiple Airports Indicator | - |
DestCityNum | Destination Airport, City Code | - |
DestCountry | Destination Airport, Country Code | - |
DestStateFips | Destination Airport, State FIPS Code | - |
DestState | Destination Airport, State Code | - |
DestStateName | Destination State Name | - |
DestWac | Destination Airport, World Area Code | - |
Break | Trip Break Code | - |
CouponType | Coupon Type Code | - |
TkCarrier | Ticketing Carrier Code | - |
OpCarrier | Operating Carrier Code | - |
RPCarrier | Reporting Carrier Code | - |
Passengers | Number of Passengers | - |
FareClass | Fare Class Code | - |
Distance | Coupon Distance | - |
DistanceGroup | Distance Group, in 500 Mile Intervals | - |
Gateway | Gateway Indicator (1=Yes) | - |
ItinGeoType | Itinerary Geography Type | - |
CouponGeoType | Coupon Geography Type | - |
Notes:
- FareClass does not follow the same coding as the class column of the T100 Domestic Segment table. The US Department of Transportation also warns not to use FareClass for analysis because different carriers treat the categories differently.
DB1B Market
This table contains directional market characteristics of each domestic itinerary of the Origin and Destination Survey, such as the reporting carrier, origin and destination airport, prorated market fare, number of market coupons, market miles flown, and carrier change indicators.
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://www.transtats.bts.gov/Fields.asp?Table_ID=247
Columns:
Number | Column | Description | Notes |
---|---|---|---|
1 | ItinID | Itinerary ID | Foreign key to DB1BTicket |
2 | MktID | Market ID | Primary key |
3 | MktCoupons | Number of Coupons in the Market | - |
4 | Year | Year | - |
5 | Quarter | Quarter (1-4) | - |
6 | Origin | Origin Airport Code | - |
7 | OriginAptInd | Origin Airport, Multiple Airports Indicator | - |
8 | OriginCityNum | Origin Airport, City Code | - |
9 | OriginCountry | Origin Airport, Country Code | - |
10 | OriginStateFips | Origin Airport, State FIPS Code | - |
11 | OriginState | Origin Airport, State Code | - |
12 | OriginStateName | Origin State Name | - |
13 | OriginWac | Origin Airport, World Area Code | - |
14 | Dest | Destination Airport Code | - |
15 | DestAptInd | Destination Airport, Multiple Airports Indicator | - |
16 | DestCityNum | Destination Airport, City Code | - |
17 | DestCountry | Destination Airport, Country Code | - |
18 | DestStateFips | Destination Airport, State FIPS Code | - |
19 | DestState | Destination Airport, State Code | - |
20 | DestStateName | Destination State Name | - |
21 | DestWac | Destination Airport, World Area Code | - |
22 | AirportGroup | Airport Group | - |
23 | WacGroup | World Area Code Group | - |
24 | TkCarrierChange | Ticketing Carrier Change Indicator | - |
25 | TkCarrierGroup | Ticketing Carrier Group | - |
26 | OpCarrierChange | Operating Carrier Change Indicator | - |
27 | OpCarrierGroup | Operating Carrier Group | - |
28 | RPCarrier | Reporting carrier code | #1 |
29 | TkCarrier | Ticketing carrier code | #2 |
30 | OpCarrier | Operating carrier code | #2 |
31 | BulkFare | Bulk fare indicator | - |
32 | Passengers | Number of passengers | - |
33 | MktFare | Market fare (ItinYield*MktMilesFlown) | - |
34 | MktDistance | Market distance | Includes ground transport |
35 | MktDistanceGroup | Distance group, in 500 mile intervals | - |
36 | MktMilesFlown | Market miles flown (track miles) | - |
37 | NonStopMiles | Non-Stop Market Miles (Radian Measure) | - |
38 | ItinGeoType | Itinerary Geography Type | - |
39 | MktGeoType | Market geography type | - |
Notes:
- Carriers report complete itineraries so this should be equal for all markets and coupons in an itinerary.
- For on-line itineraries (otherwise equal to 99).
DB1B Ticket
This table contains summary characteristics of each domestic itinerary on the Origin and Destination Survey, including the reporting carrier, itinerary fare, number of passengers, originating airport, roundtrip indicator, and miles flown.
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://www.transtats.bts.gov/Fields.asp?Table_ID=272
Columns:
Number | Column | Description | Notes |
---|---|---|---|
1 | ItinID | Itinerary ID | Primary key |
2 | Coupons | Number of coupons in the itinerary | - |
3 | Year | Year | - |
4 | Quarter | Quarter (1-4) | - |
5 | Origin | Origin airport code | - |
6 | OriginAptInd | Origin airport, multiple airports indicator | - |
7 | OriginCityNum | Origin airport, city code | - |
8 | OriginCountry | Origin airport, country | - |
9 | OriginStateFips | Origin airport, state FIPS | - |
10 | OriginState | Origin airport, state | - |
11 | OriginStateName | Origin state name | - |
12 | OriginWac | Origin airport, world area code | - |
13 | RoundTrip | Round trip indicator | - |
14 | OnLine | OnLine carrier indicator | - |
15 | DollarCred | Dollar credibility indicator | - |
16 | ItinYield | Total itinerary yield (ItinFare/MilesFlown) | - |
17 | RPCarrier | Reporting carrier | - |
18 | Passengers | Number of passengers | - |
19 | ItinFare | Total itinerary fare | - |
20 | BulkFare | Bulk fare indicator | - |
21 | Distance | Itinerary distance | Including Ground Transport |
22 | DistanceGroup | Distance group, in 500 mile intervals | - |
23 | MilesFlown | Itinerary miles flown (track miles) | - |
24 | ItinGeoType | Itinerary geography type | - |
Air Carrier Statistics (Form 41 Traffic)
Source: http://www.transtats.bts.gov/Tables.asp?DB_ID=111
T100 Domestic Market
This table contains domestic market data reported by both U.S. and foreign air carriers, including carrier, origin, destination, and service class for enplaned passengers, freight and mail when both origin and destination airports are located within the boundaries of the United States and its territories. Foreign carrier data is not available until 3 months after U.S. carrier data is released.
Frequency: Monthly
Range: 1990–Present
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://transtats.bts.gov/Fields.asp?Table_ID=310.
Columns:
Number | Column | Description | Notes |
---|---|---|---|
1 | Year | Year | - |
2 | Quarter | Quarter | - |
3 | Month | Month | - |
4 | Origin | Origin Airport | - |
5 | OriginCityName | Origin City | - |
6 | OriginCityNum | Origin City Code | - |
7 | OriginState | Origin State Code | - |
8 | OriginStateFips | Origin State FIPS | - |
9 | OriginStateName | Origin Airport, State Name | - |
10 | OriginWac | Origin Airport, World Area Code | - |
11 | Dest | Destination Airport | - |
12 | DestCityName | Destination City | - |
13 | DestCityNum | Destination City Code | - |
14 | DestState | Destination State Code | - |
15 | DestStateFips | Destination State FIPS | - |
16 | DestStateName | Destination Airport, State Name | - |
17 | DestWac | Destination Airport, World Area Code | - |
18 | AirlineID | US DOT carrier ID number | - |
19 | UniqueCarrier | Unique carrier code | e.g., PA, PA(1), PA(2) |
20 | UniqueCarrierName | Unique carrier name | e.g., Air Caribbean, Air Caribbean (1) |
21 | UniqCarrierEntity | Unique entity for a carrier’s operation region | - |
22 | CarrierRegion | Carrier’s operation region | - |
23 | Carrier | IATA carrier code | Not unique over time |
24 | CarrierName | Carrier Name | - |
25 | CarrierGroup | Carrier Group Code | Deprecated |
26 | CarrierGroupNew | Carrier Group New | - |
27 | Distance | Distance | miles |
28 | DistanceGroup | Distance Intervals | Every 500 miles |
29 | Class | Service Class | - |
30 | Passengers | On-Flight Market Passengers Enplaned | - |
31 | Freight | On-Flight Market Freight Enplaned | pounds |
32 | On-Flight Market Mail Enplaned | pounds |
T100 Domestic Segment
The T100 Domestic Segment is a table in the Form 41 Traffic database provided by the US Department of Transportation. The table description from the Form 41 Traffic database profile follows:
This table contains domestic non-stop segment data reported by U.S. air carriers, including carrier, origin, destination, aircraft type and service class for transported passengers, freight and mail, available capacity, scheduled departures, departures performed, and aircraft hours when both origin and destination airports are located within the boundaries of the United States and its territories.
Frequency: Monthly
Range: 1990–Present
Source: TranStats, US Department of Transportation, Bureau of Transportation Statistics: http://transtats.bts.gov/Fields.asp?Table_ID=311.
Columns:
Number | Column | Description | Notes |
---|---|---|---|
1 | year | Year the flight was provided | - |
2 | quarter | Quarter the flight was provided | - |
3 | month | Month the flight was provided | - |
4 | origin | Origin IATA airport code | - |
5 | origin_city_name | Origin city name | - |
6 | origin_city_num | Origin city number | - |
7 | origin_state_abr | Origin state abbreviation | - |
8 | origin_state_fips | Origin state FIPS code | - |
9 | origin_state_nm | Origin state name | - |
10 | origin_wac | Origin world area code | - |
11 | dest | Destination IATA airport code | - |
12 | dest_city_name | Destination city name | - |
13 | dest_city_num | Destination city number | - |
14 | dest_state_abr | Destination state abbreviation | - |
15 | dest_state_fips | Destination state FIPS code | - |
16 | dest_state_nm | Destination state name | - |
17 | dest_wac | Destination world area code | - |
18 | airline_id | DOT airline ID number | - |
19 | unique_carrier | DOT unique carrier code | - |
20 | unique_carrier_name | DOT unique carrier name | - |
21 | unique_carrier_entity | DOT unique carrier entity ID | - |
22 | region | Carrier region | See values below |
23 | carrier | IATA carrier code | - |
24 | carrier_name | Carrier name | - |
25 | carrier_group | Carrier group | - |
26 | carrier_group_new | New carrier group | - |
27 | distance | Distance in miles | - |
28 | distance_group | Distance group | In 500 mile intervals |
29 | class | Service class | - |
30 | aircraft_group | Aircraft class | - |
31 | aircraft_type | Aircraft type | - |
32 | aircraft_config | Aircraft configuration | - |
33 | departures_scheduled | Scheduled departures | - |
34 | departures_performed | Departures performed | - |
35 | payload | Available payload | pounds |
36 | seats | Number of seats | - |
37 | passengers | Passengers transported | - |
38 | freight | Freight transported | pounds |
39 | Mail transported | pounds | |
40 | ramp_to_ramp | Ramp to ramp time | minutes |
41 | air_time | Total airborne time | minutes |
Notes
Carrier Region may take the following values:
- A - Atlantic
- D - Domestic
- I - International
- L - Latin America
- P - Pacific
- S - System
Note that this does not correspond to the coding in the Aviation Support Tables, which simply lists carrier regions by name (e.g. “Atlantic”). One could match on the first character of these fields if needed.
Airline On-Time Performance Data
This table contains on-time arrival data for non-stop domestic flights by major air carriers, and provides such additional items as departure and arrival delays, origin and destination airports, flight numbers, scheduled and actual departure and arrival times, cancelled or diverted flights, taxi-out and taxi-in times, air time, and non-stop distance.
Source: http://www.transtats.bts.gov/Tables.asp?DB_ID=120
Range: 1987–Present
Frequency: Monthly
Columns:
Number | Column | Description |
---|---|---|
1 | Year | Year |
2 | Quarter | Quarter (1-4) |
3 | Month | Month |
4 | AirlineID | US DOT ID number |
5 | UniqueCarrier | Unique carrier code |
6 | Carrier (#1) | IATA carrier code |
7 | FlightDate | Flight date (yyyymmdd) |
8 | DayofMonth | Day of month |
9 | DayOfWeek | Day of week |
10 | Flights | Number of flights |
11 | FlightNum | Flight number |
12 | TailNum | Tail number |
13 | AirTime | Flight time (minutes) |
14 | ArrDel15 | Arrival delay indicator, 15+ min. |
15 | ArrDel30 | Arrival delay indicator, 30+ min. |
16 | ArrDelSys15 (#2) | Arrival delay indicator, 15+ min. |
17 | ArrDelSys30 (#2) | Arrival delay indicator, 30+ min. |
18 | ArrDelay | Arrival delay (minutes) |
19 | ArrTime | Actual arrival time (hhmm) |
20 | ArrTimeBlk | CRS arrival time block, hourly intervals |
21 | CRSArrTime | CRS arrival time (hhmm) |
22 | DepDel15 | Departure delay indicator, 15+ min. |
23 | DepDel30 | Departure delay indicator, 30+ min. |
24 | DepDelSys15 (#2) | Departure delay indicator, 15+ min. |
25 | DepDelSys30 (#2) | Departure delay indicator, 30+ min. |
26 | DepDelay | Departure Delay (minutes) |
27 | DepTime | Actual departure time (hhmm) |
28 | DepTimeBlk | CRS departure time block, hourly intervals |
29 | CRSDepTime | CRS departure time (hhmm) |
30 | Origin | Origin airport |
31 | OriginCityName | Origin airport, city name |
32 | OriginState | Origin airport, state code |
33 | OriginStateFips | Origin airport, state fips |
34 | OriginStateName | Origin airport, state name |
35 | OriginWac | Origin airport, world area code |
36 | Dest | Destination airport |
37 | DestCityName | Destination airport, city name |
38 | DestState | Destination airport, state code |
39 | DestStateFips | Destination airport, state fips |
40 | DestStateName | Destination airport, state name |
41 | DestWac | Destination airport, world area code |
42 | Distance | Non-stop distance (using radian measure) |
43 | DistanceGroup | Distance intervals (250 miles) |
44 | TaxiIn | Taxi in time (minutes) |
45 | TaxiOut | Taxi out time (minutes) |
46 | Off | Off time (hhmm) |
47 | On | On time (hhmm) |
48 | Cancelled | Cancelled flight indicator |
49 | CancellationCode | Reason for cancellation |
50 | Diverted | Diverted flight indicator |
51 | CarrierDelay | Carrier delay (minutes) |
52 | WeatherDelay | Weather delay (minutes) |
53 | NASDelay | NAS delay (minutes) |
54 | SecurityDelay | Security delay (minutes) |
55 | LateAircraftDelay | Late aircraft delay (minutes) |
Notes:
- Not unique over time.
- Includes cancelled or diverted flights.
Aviation Support Tables
The Aviation Support Tables provide comprehensive information about U.S. and foreign air carriers, carrier entities, worldwide airport locations, and other geographic data. These data also include information on various aircraft types, their manufacturer and model names. This information is developed and maintained by the Office of Airline Information, Bureau of Transportation Statistics, and is updated on an on-going basis.
Source: http://www.transtats.bts.gov/Tables.asp?DB_ID=595
Aircraft Types Table
This table identifies the different types of aircraft being reported to the DOT, by DOT assigned aircraft type codes. The information includes the DOT assigned aircraft code, the aircraft manufacturer, and the aircraft model names.
Source: http://www.transtats.bts.gov/Fields.asp?Table_ID=300
Columns:
ac_typeid - Aircraft type identification number. This number is related to the aircraft group number and falls within the range of a group number.
ac_group - Aircraft type group. This number gives the group or classification of aircraft engine and type of aircraft.
ssd_name - Aircraft name.
manufacturer - Manufacturing company name.
long_name - Complete name of the aircraft.
short_name - Abbreviated name of the aircraft.
begin_date - The date when the aircraft was added to the database.
end_date - The date through which aircraft type remains in effect.
Sample Record:
"691",6,"A300-600/R/CF/RCF","AIRBUS INDUSTRIE","AIRBUS INDUSTRIE A300-600/R/CF/RCF","A300-6",1990-01-01,
Carrier Decode Table
This table provides a list of domestic and foreign air carrier codes, by T–100 and OAG designations, charter carrier codes, carrier descriptions, entity codes, service region, and beginning and ending dates of service.
Source: http://www.transtats.bts.gov/Fields.asp?Table_ID=304
Columns:
airline_id - An identification number assigned by US DOT to identify a unique airline (carrier). A unique airline (carrier) is defined as one holding and reporting under the same DOT certificate regardless of its Code, Name, or holding company/corporation.
carrier - Code assigned by IATA and commonly used to identify a carrier. As the same code may have been assigned to different carriers over time, the code is not always unique.
carrier_entity
carrier_name
unique_carrier - Unique carrier code. When the same code has been used by multiple carriers, a numeric suffix is used for earlier users, for example, PA, PA(1), PA(2). Use this field for analysis across a range of years.
unique_carrier_entity - Unique entity for a carrier’s operation region.
unique_carrier_name - Unique carrier name. When the same name has been used by multiple carriers, a numeric suffix is used for earlier users, for example, Air Caribbean, Air Caribbean (1).
wac - World Area Code.
carrier_group - Carrier group code. Used in legacy analysis.
carrier_group_new - Carrier group new.
region - Carrier’s operation region. Carriers report data by operation region.
start_date_source - Starting date of carrier code.
thru_date_source - Ending date of carrier code (active = NULL).
Notes:
- The dates are not in the usual ISO format (YYYY-MM-DD) used by most software so be careful when importing this table.
Master Coordinate Table
This table contains a list of domestic and foreign airport codes and their associated city codes, world area codes, city or airport names, state or country names, and latitude and longitude information.
Source: http://www.transtats.bts.gov/Fields.asp?Table_ID=288
Columns:
airport - Airport code.
tr_airport_name - Airport name with country/state information.
tr_city_name - City name with country/state information.
tr_state_name - State name.
tr_country_name - Country name.
tr_wac - World Area Code.
begin_date_source - Starting date of airport code.
end_date - Ending date of airport code (active = NULL).
airport_region - Airport region.
lat_degrees - Latitude, degrees.
lat_minutes - Latitude, minutes.
lat_seconds - Latitude, seconds.
lat_hemisphere - Latitude, hemisphere.
latitude - Latitude (decimal).
lon_degrees - Longitude, degrees.
lon_minutes - Longitude, minutes.
lon_seconds - Longitude, seconds.
lon_hemisphere - Longitude, hemisphere.
longitude - Longitude.
airport_id - Unique airport numeric value.
Notes:
Some entries in the wac column are zero! This is an invalid value according to the DOT World Area Code lookup table.
The region column is very inconsistent (e.g., Wisconsin, Wis, Wis.) and is basically a duplication of the information in the state column.
The coordinates are stored in two formats: minutes-degrees-seconds and decimal degrees.