Summary
In information science, profiling refers to the process of construction and application of s generated by computerized data analysis. This is the use of algorithms or other mathematical techniques that allow the discovery of patterns or correlations in large quantities of data, aggregated in databases. When these patterns or correlations are used to identify or represent people, they can be called profiles. Other than a discussion of profiling technologies or population profiling, the notion of profiling in this sense is not just about the construction of profiles, but also concerns the application of s to individuals, e. g., in the cases of credit scoring, price discrimination, or identification of security risks . Profiling is being used in fraud prevention, ambient intelligence, and consumer analytics. Statistical methods of profiling include Knowledge Discovery in Databases (KDD). The technical process of profiling can be separated in several steps: Preliminary grounding: The profiling process starts with a specification of the applicable problem domain and the identification of the goals of analysis. Data collection: The target dataset or database for analysis is formed by selecting the relevant data in the light of existing domain knowledge and data understanding. Data preparation: The data are preprocessed for removing noise and reducing complexity by eliminating attributes. Data mining: The data are analysed with the algorithm or heuristics developed to suit the data, model and goals. Interpretation: The mined patterns are evaluated on their relevance and validity by specialists and/or professionals in the application domain (e.g. excluding spurious correlations). Application: The constructed profiles are applied, e.g. to categories of persons, to test and fine-tune the algorithms. Institutional decision: The institution decides what actions or policies to apply to groups or individuals whose data match a relevant profile. Data collection, preparation and mining all belong to the phase in which the profile is under construction.
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related publications (15)
Related concepts (3)
Digital footprint
Digital footprint or digital shadow refers to one's unique set of traceable digital activities, actions, contributions, and communications manifested on the Internet or digital devices. Digital footprints can be classified as either passive or active. The former is composed of a user's web-browsing activity and information stored as cookies. The latter is often released deliberately by a user to share information on websites or social media. While the term usually applies to a person, a digital footprint can also refer to a business, organization or corporation.
Targeted advertising
Targeted advertising is a form of advertising, including online advertising, that is directed towards an audience with certain traits, based on the product or person the advertiser is promoting. These traits can either be demographic with a focus on race, economic status, sex, age, generation, level of education, income level, and employment, or psychographic focused on the consumer values, personality, attitude, opinion, lifestyle and interest.
Internet privacy
Internet privacy involves the right or mandate of personal privacy concerning the storage, re-purposing, provision to third parties, and display of information pertaining to oneself via the Internet. Internet privacy is a subset of data privacy. Privacy concerns have been articulated from the beginnings of large-scale computer sharing and especially relate to mass surveillance enabled by the emergence of computer technologies. Privacy can entail either personally identifiable information (PII) or non-PII information such as a site visitor's behaviour on a website.