Publication result detail

Collection of Datasets with DNS over HTTPS Traffic

JEŘÁBEK, K.; HYNEK, K.; ČEJKA, T.; RYŠAVÝ, O.

Original Title

Collection of Datasets with DNS over HTTPS Traffic

English Title

Collection of Datasets with DNS over HTTPS Traffic

Type

WoS Article

Original Abstract

The DNS over HTTPS (DoH) is becoming a default option for domain resolution in modern privacy-aware software. Therefore, researchers have already focused on various aspects; however, a comprehensive dataset from an actual production network is still missing. This paper presents a collection of novel datasets comprising multiple PCAP files of DoH and HTTPS traffic. The captured traffic is generated towards multiple DoH providers to cover differences of various DoH server implementations and configurations. In addition to generated traffic, we also provide real network traffic captured on high-speed backbone lines of a large Internet Service Provider with around half a million users. Even though the network identifiers (excluding network identifiers of DoH resolvers) in the real network traffic (e.g., IP addresses and transmitted content) were anonymized, the essential characteristics of the traffic can still be obtained from the data. Therefore, the dataset can be used in whole network traffic analysis areas such as traffic classification research.

English abstract

The DNS over HTTPS (DoH) is becoming a default option for domain resolution in modern privacy-aware software. Therefore, researchers have already focused on various aspects; however, a comprehensive dataset from an actual production network is still missing. This paper presents a collection of novel datasets comprising multiple PCAP files of DoH and HTTPS traffic. The captured traffic is generated towards multiple DoH providers to cover differences of various DoH server implementations and configurations. In addition to generated traffic, we also provide real network traffic captured on high-speed backbone lines of a large Internet Service Provider with around half a million users. Even though the network identifiers (excluding network identifiers of DoH resolvers) in the real network traffic (e.g., IP addresses and transmitted content) were anonymized, the essential characteristics of the traffic can still be obtained from the data. Therefore, the dataset can be used in whole network traffic analysis areas such as traffic classification research.

Keywords

DNS over HTTPS, DNS, HTTPS, Computer, Network,Monitoring,Network Traffic

Key words in English

DNS over HTTPS, DNS, HTTPS, Computer, Network,Monitoring,Network Traffic

Authors

JEŘÁBEK, K.; HYNEK, K.; ČEJKA, T.; RYŠAVÝ, O.

RIV year

2023

Released

01.06.2022

ISBN

2352-3409

Periodical

Data in Brief

Volume

2022

Number

42

State

Kingdom of the Netherlands

Pages from

1

Pages to

13

Pages count

14

URL

BibTex

@article{BUT178119,
  author="Kamil {Jeřábek} and Karel {Hynek} and Tomáš {Čejka} and Ondřej {Ryšavý}",
  title="Collection of Datasets with DNS over HTTPS Traffic",
  journal="Data in Brief",
  year="2022",
  volume="2022",
  number="42",
  pages="1--13",
  doi="10.1016/j.dib.2022.108310",
  issn="2352-3409",
  url="https://www.sciencedirect.com/science/article/pii/S2352340922005121"
}

Documents