Publication

Clustering citation histories in the Physical Review

Abstract

We investigate publications through their citation histories – the history events are the citations given to the article by younger publications and the time of the event is the date of publication of the citing article. We propose a methodology, based on spectral clustering, to group citation histories, and the corresponding publications, into communities and apply multinomial logistic regression to provide the revealed communities with semantics in terms of publication features. We study the case of publications from the full Physical Review archive, covering 120 years of physics in all its domains. We discover two clear archetypes of publications – marathoners and sprinters – that deviate from the average middle-of-the-roads behaviour, and discuss some publication features, like age of references and type of publication, that are correlated with the membership of a publication into a certain community.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related concepts (17)
Multinomial logistic regression
In statistics, multinomial logistic regression is a classification method that generalizes logistic regression to multiclass problems, i.e. with more than two possible discrete outcomes. That is, it is a model that is used to predict the probabilities of the different possible outcomes of a categorically distributed dependent variable, given a set of independent variables (which may be real-valued, binary-valued, categorical-valued, etc.).
Linear regression
In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is called simple linear regression; for more than one, the process is called multiple linear regression. This term is distinct from multivariate linear regression, where multiple correlated dependent variables are predicted, rather than a single scalar variable.
Citation
A citation is a reference to a source. More precisely, a citation is an abbreviated alphanumeric expression embedded in the body of an intellectual work that denotes an entry in the bibliographic references section of the work for the purpose of acknowledging the relevance of the works of others to the topic of discussion at the spot where the citation appears. Generally, the combination of both the in-body citation and the bibliographic entry constitutes what is commonly thought of as a citation (whereas bibliographic entries by themselves are not).
Show more
Related publications (32)

Beyond the average consumer: Mapping the potential of demand-side management among patterns of appliance usage

Claudia Rebeca Binder Signer, Selin Yilmaz, Matteo Barsanti

To support the decarbonisation of the power sector and offset the volatility of a system with high levels of renewables, there is growing interest in residential Demand-Side Management (DSM) solutions. Traditional DSM strategies require consumers to active ...
2024

Spatial Distributions of Diarrheal Cases in Relation to Housing Conditions in Informal Settlements: A Cross-Sectional Study in Abidjan, Côte d’Ivoire

Jérôme Chenal, Vitor Pessoa Colombo, Jürg Utzinger

In addition to individual practices and access to water, sanitation, and hygiene (WASH) facilities, housing conditions may also be associated with the risk of diarrhea. Our study embraced a broad approach to health determinants by looking at housing depriv ...
2023

Diverse parameters of ambulatory knee moments differ with medial knee osteoarthritis severity and are combinable into a severity index

Julien Favre

Objective: To characterize ambulatory knee moments with respect to medial knee osteoarthritis (OA) severity comprehensively and to assess the possibility of developing a severity index combining knee moment parameters. Methods: Nine parameters (peak amplit ...
FRONTIERS MEDIA SA2023
Show more
Related MOOCs (2)
Selected Topics on Discrete Choice
Discrete choice models are used extensively in many disciplines where it is important to predict human behavior at a disaggregate level. This course is a follow up of the online course “Introduction t
Selected Topics on Discrete Choice
Discrete choice models are used extensively in many disciplines where it is important to predict human behavior at a disaggregate level. This course is a follow up of the online course “Introduction t

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.