Network, Browser and Website Fingerprint

From Whonix
Jump to navigation Jump to search

About Whonix Network, Bridge/Guard, Internet Service Provider (ISP) Fingerprint and Website Traffic Fingerprinting.

Introduction[edit]

In this chapter, the term fingerprint refers to the specific way Whonix behaves on the Internet. Those specificities could be used to determine whether a particular user is running Whonix or not.

As explained on the Warning page, the default Whonix configuration does not hide Tor use from network observers. However, the Whonix design attempts to make Whonix users indistinguishable from the rest of the Tor population, particularly Tor Browser (TB) [1] users. If Whonix and TBB [1] users have distinct fingerprints, then this information significantly degrades anonymity because the Whonix user base is far smaller than the broader Tor population. [2]

This section briefly addresses possible Whonix fingerprinting issues and how adversaries might use this information to verify Whonix is in use.

Fingerprinting Information[edit]

Info Tip: A unique fingerprint makes the user pseudonymous rather than anonymous.

Various types of information can be leaked about the user's browser, (host) operating system and hardware depending on the external party in question.

Entry Guards or Bridges[edit]

As noted in the Guard Fingerprinting chapter, using persistent Tor guards or bridges can threaten anonymity under certain circumstances:

While natural guard rotation is recommended, there are some corner cases in which an adversary could fingerprint the entry guards and de-anonymize a user. For instance:

  • The same entry guards are used across various physical locations and access points.
  • The same entry guards are used after permanently moving to a different physical location.

For example, if an adversary monitoring network activity observes a user connecting from multiple physical locations with persistent guards/bridges, then it can be reasonably assumed that all connections stem from the same person. Mitigating this risk requires techniques like using new Tor entry guards or configuring alternate bridges for different places.

Nick Mathewson, co-founder of The Tor Project suggests additional precautions when moving networks: [3]

  • Spoof the MAC address with randomized values on each move.
  • Absolutely prevent non-Tor connections.
  • Ensure a unique set of Tor entry guards (bridges) is utilized for each network you connect from. [4]
  • Minimize the threat of stored Tor state files which record every network visited.

"Absolutely prevent non-Tor connections." is useful anyhow for other reasons as well, see also Do Not Connect to a Server Anonymously and Non-anonymously at the Same Time. [5]

ISP or Local Network Administrators[edit]

Table: Fingerprinting Domains

Domain Description
Network Stack Hardening Whonix has implemented various security hardeningarchive.org measures like disabling TCP timestamps, ICMP redirections, firewalling invalid packages, and more. Unfortunately these measures can increase the risk of ISP or Local Network fingerprinting. Despite this, security hardening has been prioritized.
Random ISN Generation Whonix prevents de-anonymization of Tor onion servicesarchive.org through Tirdad kernel module for random ISN generationarchive.org. De-anonymization resistance unfortunately reduces ISP or Local Network fingerprinting resistance. Despite this, resistance against de-anonymization has been prioritized.
Tor Enforcement

Whonix solely generates Tor activity on the network. All traffic from both the Whonix-Workstation (whonix-workstation-17) and Whonix-Gateway (whonix-gateway-17) such as TBB [1], updates, and sdwdate pass through Tor. Establishing an online connection is the task of the host, so the host [6] is most likely using DHCP to obtain a local IP address.

In contrast, usually TBB [1] users have additional network activity outside of Tor, either from another web browser or other applications. This means the proportion or volume of Tor activity might be feasible identifiers to predict whether a user is running Whonix or the TBB [1]. It is probably harder for the ISP to determine whether a single user is solely generating Tor traffic (and potentially using Whonix) if:

  • The Internet connection is shared with other users that do not run Whonix.
  • A browser is also used on the host. [7]
Tor Entry Guards Whonix uses an unmodified version of Tor, [8] so entry guards are used as the default mechanism to connect to the Tor network. [9] Consequently, a Tor user will maintain the same relay as the first hop for an extended period, [10] which is a security feature.

One addition which is unique to Whonix is its Tor configuration [11] for the Stream Isolation[12] security feature.

Time Synchronization When Whonix is started, the system clock is synchronized to make sure it slightly differs from the host clock via sdwdate. systemcheckarchive.org also issues some network traffic to check for updates and news, which all passes through different circuits. This behavior might be specific to Whonix. [13]
Website Traffic Fingerprinting Website traffic fingerprinting is also an open Tor research question, which is unspecific to Whonix. [14] A related and unresearched issue is whether fingerprinting risks also apply to other traffic, such as apt traffic.
Whonix-Host Once Whonix-Host is available it will exclusively generate Tor traffic in the default configuration. [15] This is very different from other operating systems, for example a Microsoft Windows host running Tor Browser.

In conclusion, the ability of the ISP or local network administrator to distinctly identify users who utilize Tor Browser Bundle, Whonix, Tails, a custom transparent Tor proxy or similar project, depends on how differently a system is configured. Anonymity/security/privacy-focused operating systems will inevitably differ from other Tor users using popular operating systems. For example, somebody using Tails or Whonix will have a different network fingerprint than a Microsoft Windows user utilizing Tor Browser Bundle since that will be paired with standard clearnet traffic. This is a necessary tradeoff to advance the state of technological privacy relative to the status quo.

Improved fingerprinting resistance in the near term is unlikely. This would require choosing and emulating the most popular host operating system in combination with the most widespread method of accessing Tor, which is probably Tor Browser on Microsoft Windows. Without running this setup, it is infeasible the Microsoft Windows network fingerprint could be sufficiently emulated in order to fool an ISP or local network observer. Putting aside the many reasons to not use Windows, even after lots of effort it could not be proven effective.

A related topic is the potential to Hide Tor use from the Internet Service Provider. It has been concluded it is possible to circumvent censorship, but hiding Tor is impractical. This is especially true with respect to global passive adversaries who can record and store all relevant traffic.

It is impossible to hide Tor use from the Internet Service Provider (ISP). It has been concluded this goal is difficult beyond practicality.

Visited Websites[edit]

Destination websites can retrieve a lot of information about a user's browser and system, while advanced adversaries have even greater capabilities. [16] This information can include:

  • the browser name and version
  • CSS media queries:
    • window dimensions
    • desktop size
    • widget size
    • display type
    • DPI
  • a list of available extensions
  • timezone [17]
  • available fonts
  • user agent
  • video card in use [18]
  • CPU and interpreter speed
  • browser history [19]
  • via exploited plugins:
    • leak the non-Tor IP address
    • interface addresses and other machine information
    • list all plugins to fingerprint the user
    • retrieve unique plugin identifiers
  • read / store identifiers related to HTTP auth, DOM storage, cached scripts, client certificates and TLS session IDs
  • browser cache
  • giving the user a unique favicon [20] [21]

To make it difficult to distinguish Whonix and TBB [1] users, TBB [1] is included on the platform. Therefore, Whonix should provide the same information as TBB [1] in order to generate very similar fingerprints.

Website Traffic Fingerprinting[edit]

"Website fingerprinting" is an attack where the adversary observes a user's encrypted data traffic, and uses traffic timing and quantity to guess what website is being visited. In order to find potential target matches, the adversary utilizes a database of web pages -- which are regularly downloaded in order to record their traffic timing and quantity characteristics -- for comparison against encrypted traffic. This attack is carried out by adversaries external to the Tor network who observe traffic between a user and the Tor relay or bridge. [22] [23] The observer will not know the exact contents of the page (such as user names, passwords, etc.), but they can know with reasonable certainty that a specific website was visited, such as google.com.

This Tor Project blog postarchive.org suggests this attack on clearnet sites visited through Tor is impractical. The reason is the sheer variety of traffic and the vast number of pages visited through Tor results in a very high rate of false positives. [24] However, the attack might be more practical against users visiting Onion services because the classifier leverages real-time ad network bidding and DNS to further narrow down the possible set of pages accessed to the Onion-space and further still, picking out which one. Onion service padding is deployed to mitigate this threat.

This attack should not be confused with end-to-end correlation attacks performed by malicious Entry guards set up by Tor-relay level adversaries.

Whonix Fingerprint Comparison[edit]

Further Reading[edit]

See Also[edit]

References[edit]

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 TBB stands for Tor Browser Bundlearchive.org. It is included in Whonix, see Tor Browser.
  2. Certainly less than 50,000 users, although an exact figurearchive.org is yet to be published.
  3. tor-dev: entry guards and linkabilityarchive.org
  4. Note: this is not a recommendation for non-persistent guards because a hostile DHCP server might provide new IPs until a hostile guard is chosen.
  5. Qubes feature request: document how to route all traffic over Tor / how to disable Qubes default clearnet trafficarchive.org
  6. In case of Default/Download version, it is the host's task to establish an online connection. In the case of Physical Isolation, it is the gateway's task to establish an online connection.
  7. This comes with the attendant risk of the user confusing one browser with another.
  8. Whonix uses an unmodified version of Tor
  9. https://support.torproject.org/#about_entry-guardsarchive.org
  10. Typically the entry guards are rotated after a few months.
  11. https://github.com/Whonix/anon-gw-anonymizer-configarchive.org
  12. Stream Isolation
  13. It is unknown if an ISP can detect whether a user has many different Tor circuits open. On the other hand, Tor seems to only open X entry guards and maintain them for a period, thus not opening as many entry guards as streams.
  14. See Tor Browser Designarchive.org for further exploration of this issue.
  15. Should Whonix host be fully torified by default?archive.org
  16. Information that is leaked depends on the browser in use, JavaScript settings, Tor Browser's security slider settings, whether it is a malicious attack or not, and other factors.
  17. Via the date object.
  18. Via WebGL.
  19. For example, using CSS and JavaScript to perform history disclosure attacks.
  20. Note that in the Tor Browser, favicons are wiped after pressing the 'New Identity' button or exiting the current session. The Tor Browser cannot protect you from this method while you are in a session though. This method is usually only used to detect if a user of the site that previously browsed the site at a different connection is accessing the site from the same browser regardless if they try to obfuscate where they are connecting from. There are no recorded attacks of this being used against Whonix or regular users in the wild, but practical implementations do exist.
  21. An example of an implementation is the supercookie project at https://github.com/jonasstrehle/supercookiearchive.org
  22. Quote research paper by University of Waterloo, Website Fingerprinting: Attacks and Defensesarchive.org:

    Website fingerprinting attacks allow a local, passive eavesdropper to determine a client's web activity by leveraging features from her packet sequence. These attacks break the privacy expected by users of privacy technologies, including low-latency anonymity networks such as proxies, VPNs, or Tor. As a discipline, website fingerprinting is an application of machine learning techniques to the diverse field of privacy. To perform a website fingerprinting attack, the eavesdropping attacker passively records the time, direction, and size of the client's packets. Then, he uses a machine learning algorithm to classify the packet sequence so as to determine the web page it came from.

  23. https://forums.whonix.org/t/new-low-cost-traffic-analysis-attacks-and-mitigations/8708archive.org forum discussion
  24. https://blog.torproject.org/critique-website-traffic-fingerprinting-attacksarchive.org

License[edit]

Whonix Fingerprint wiki page Copyright (C) Amnesia <amnesia at boum dot org>
Whonix Fingerprint wiki page Copyright (C) 2012 - 2023 ENCRYPTED SUPPORT LP <

This program comes with ABSOLUTELY NO WARRANTY; for details see the wiki source code.
This is free software, and you are welcome to redistribute it under certain conditions; see the wiki source code for details.

We believe security software like Whonix needs to remain open source and independent. Would you help sustain and grow the project? Learn more about our 12 year success story and maybe DONATE!