The GDPR governs the processing of personal data. In the context of health and human genetic data processing, determining when data are personal data is of fundamental importance and, at the same time, a challenge.
As portrayed in the subentry on ‘Data Protection Main Principles’, the intrinsic nature and characteristics of human genetic data raise complex challenges not only for data classification but also for the interpretation and application of data protection principles and rules as a whole.
Concerning the processing of human genetic data within the clinical, health and pharmaceutical ecosystems, including for research purposes, it becomes increasingly harder to underpin when genomic data is to be considered personal data. As technical and scientific developments unfold, the analysis of human genetic data becomes increasingly specialized and complex, which may result in the segmentation of human genetic data to an extent which may hinder the task of classifying it as genetic information, health information and personal or non-personal data.
Furthermore, human genetic data is often pseudonymised or processed through automated means which seek to take out the personal or identifiable elements of such data. Very often, such measures do not ensure the irreversible anonymisation of personal data given the subsistence of a link, albeit indirect, with the data subject to whom such data concerns.
At the same time, genomic data may be seen as inherently identifying personal data (even in cases where there are no further links to or impacts on the data subject) since these are, in themselves, elements which inextricably integrate part of a person’s identity.
Moreover, another challenge is the interpretation of the concept of human genetic data as defined in the law. The scientific and legal ambiguities associated with this particular kind of data, as well as the current tendency for legal fragmentation and the lack of a homogenous approach, jeopardize a tangible definition. This generates risks and threats related to the existing data protection principles and requirements, as well as the disruption of the potential for specific regulations, guidelines or best practices on the processing of human genetic data across the EU.
Although the concepts of genetic and health data may overlap, genetic data may provide insight into a data subject’s future health condition. Additionally, genetic data may unveil information concerning groups of individuals (such as family). This also brings about intricacies in the application of data protection principles and provisions as most legal frameworks take an individualistic approach to data protection rights.
Any processing of personal data must comply with the GDPR, deemed to be the most stringent law on data protection in the world. When the data at stake are sensitive data - such as health and genetic data – the process becomes even more complicated, for a start because two cumulative legal grounds must be met to guarantee the lawfulness of the data processing: a general legal ground for every kind of personal data processing, set forth in Article 6(1) of the GDPR, and a specific legal ground, required only for sensitive data, established in Article 9(2) of the GDPR.
Failure to comply with the norms of the GDPR might involve heavy administrative fines. Article 83 GDPR provides for two tiers of fines: the less severe ones can go up to €10 million, or in the case of an undertaking, up to 2% of the worldwide annual revenue from the preceding financial year, whichever amount is higher (Article 83(4) GDPR); whereas the most severe infringement can result in a fine up to €20 million, or in the case of an undertaking, up to 4% of the worldwide annual revenue from the preceding financial year, whichever amount is higher (Article 83(5) GDPR). In addition, the data controllers and/or data processors that, due to the infringement of the GDPR, caused material or non-material damages to a natural person, might be asked to pay compensation for damages, as per Article 82 of the GDPR.
The GDPR does not exclude the provision of criminal penalties for its violation, as provided for in the national laws of Member States.
A way to avoid being ruled by the GDPR - and thus to avoid its severe sanctions – is to operate with anonymised data instead. However, full anonymisation is extremely difficult to reach, as even the smallest detail can reveal the natural person’s identity.[2] The challenges posed by anonymization are especially stringent in regard to health data and genetic data. It has been said that anonymisation is impossible to achieve, as it is always possible to reverse the process and identify the data subject. This is especially true for genetic data, which are related to core identifying features of the human person.
GDPR’s Recital 34 extends the definition of genetic data to a general concept, which not only includes chromosomal, DNA or RNA analysis, but all other types of analysis that enable the obtainment of equivalent information. Thus, although it may be suggested that (i) not all genetic information is genetic data; (ii) not all genetic information is personal data; (iii) a genetic sample is not personal data until it enables the drawing of conclusions through analysis and data production; as well as that (iv) data resulting from the analysis of genetic data may only constitute personal data to the extent such genetic data is associated to an identifiable person; the definition of personal data included in the GDPR is broad to the extent that encompasses any identification by reference to factors specific to the genetic identity of an individual, independently of the means of analysis at stake.
Consequently, the practical effect of such interpretation leads us to the conclusion that the processing of human genetic data (which includes core genetic markers uniquely related to a person), as well as the results stemming from such processing, will most likely constitute personal data. Moreover, the processing of certain special categories of personal data, in particular genetic data, must comply with Article 9 of the GDPR.
Additionally, all results stemming from the analysis of genetic data which are linked to a specific biological sample are usually personal data, even if the results themselves are not unique to the individual, because the sample is by its nature specific to an individual and provides the link back to his/her specific genetic identity.
Nevertheless, it should be noted that in some cases genetic data may not constitute personal data. Suppose, for instance, anonymised information (for instance, genetic testing results) that may no longer, in any circumstance, be associated with a specific person, provided that there are no further records of genetic identity nor any other identifier. However, as seen, this scenario is not common in practical terms.
In a 2021 document, the European Data Protection Board (EDPB) recognised the fallibility of anonymisation techniques regarding genetic data and recommended data controllers always treat those data as if they were personal data, even when allegedly anonymised: "The EDPB points out that the possibility to anonymise genetic data remains an unresolved issue. As yet, it remains open to be demonstrated whether any combination of technical and organisational means can be effectively employed to remove genetic information from the material scope of the GDPR (...) it is strongly advised that such genetic data is treated as personal data and that the processing thereof is conducted with the implementation of appropriate technical and organisational measures to ensure compliance with the Regulation" [3]
[2] Finck, M., & Pallas, F. They who must not be identified—distinguishing personal from non-personal data under the GDPR. International Data Privacy Law, 10(1), 11-36, 2020.
[3] European data Protection Board, EDPB Document on response to the request from the European Commission for clarifications on the consistent application of the GDPR, focusing on health research, 2 February, 2021, https://edpb.europa.eu/sites/default/files/files/file1/edpb_replyec_questionnaireresearch_final.pdf?fbclid=IwAR1Vq9hjGYWjgfRHZdT6K326uQCEWHqaSdppoBHBVrm0rcUCSr7vIAh4PLU