Andrea Mannocci

Institute of Information Science and Technologies (ISTI), Italian National Research Council (CNR), Pisa, Italy
andrea.mannocci [at]

In a nutshell: Data scientist & researcher; scholarly knowledge mining, science of science, analysis of the research landscape in the broader socioeconomic and geopolitical context, (research) data infrastructures.

The full CV can be found here.


Research Fellow

Institute of Information Science and Technologies (ISTI), Italian National Research Council (CNR), Pisa, Italy
  • Member of the InfraScience laboratory
    • Participation to the activities of the EU-funded project OpenAIRE NEXUS.
    • Data analysis of the OpenAIRE Research Graph and other scholarly data sources such as ORCID, CORDIS, CrossRef.
    • Comparison and evaluation of different scholarly data sources and the use-cases for analysis they are capable to support.
    • Quality and completeness assessment of the information contained in the OpenAIRE Research Graph.
    • Research on novel metrics and indicators to measure the openness of science and evaluate the impact of scientific research.
April 2019 - Present

Research Associate

The Knowledge Media Institute (KMi), The Open University, Milton Keynes, UK
  • Member of the Scholarly Knowledge Modelling, Mining and sense Making (SKM3) team
    • Research focusing on geopolitical and socioeconomic impact of academic research; analysis of scholarly data in the broader context.
    • Analysis of trends and dynamics within conference venues. Analysis of authors affiliations and their distribution on the research landscape.
    • Analysis of the evolution and forecast of technology diffusion across different disciplines and research areas.
    • Creation and maintenance of the Computer Science Ontology (automatically generated from scientific literature) and automatic classification of papers.
    • Re-engineerization of the SKM3 infrastructure for scholarly knowledge analytics: moving from a single dataset hosted on a relational database to a scalable aggregation and analytics infrastructure availing of the Big Data Cluster (Hadoop-based) owned by the university.
    • Exploration of the major scholarly datasets and sources of scholarly data such as CrossRef, Microsoft Academic, Dimensions, Scopus, SciGraph, SemanticScholar and DBLP.
    • Management of the SKM3 Azure infrastructure for a collaboration with Microsoft Academics.
  • Technical support and cunsultancy for the project CityLabs. Consultancies delivered to several local SMEs such as Medietas, UK postings and Design Buildings Ltd.
February 2017 - April 2019

Research Assistant

Institute of Information Science and Technologies (ISTI), The National Research Council of Italy (CNR), Pisa, Italy
  • OpenAIRE2020 task leader: data flows and dynamics monitoring services
  • EAGLE technical manager: aggregation workflows and data quality
  • Collaboration in the projects, OpenAIRE2020, OpenAIREplus, HOPE, EFG and iCORDI
  • Realisation of service-oriented architectures (SOA) for data integration infrastructures
  • Development of D-NET software toolkit
  • Design of distributed systems for collaborative research environments
  • Data modelling, data management, curation and publishing
November 2012 - January 2017

Research Assistant

IMDEA Networks, Madrid, Spain
  • Optimisation and performance evaluation of 802.11 MAC protocols
  • Collaboration within the framework of the EU FP7 research project FLAVIA
  • Research in Social Networks and predictions based on social media
September 2010 - September 2011

Research Assistant

The Mærsk Mc-Kinney Møller Institute,\\The University of Southern Denmark, Odense, Denmark
  • Research in dynamic updating of Java software
  • Realisation of a case study for Javeleon, a Dynamic Software Updating System developed by a research group in the institute
August 2009 - April 2010


Ph.D. in Information Engineering

University of Pisa, Italy

Thesis: “Data flow quality monitoring in data infrastructures”
Supervisors: Dr. Paolo Manghi (ISTI-CNR), Prof. Marco Avvenuti (University of Pisa)

2013 - 2016

M.Sc. in Telematic Engineering

University Carlos III of Madrid, Spain

Thesis: “Control Theoretic Optimization of 802.11 WLANs: Implementation and Experimental Evaluation”
Supervisors: Prof. Albert Banchs Roca (IMDEA Networks & UC3M), Prof. Vincenzo Mancuso (IMDEA Networks)

2010 - 2011

M.Sc. in Computer Engineering

University of Pisa, Italy

Thesis: “Stepwise Evolution of Java Applications using Dynamic Updates”
Supervisor: Prof. Bo Nørregaard Jørgensen (Syddansk Universitet), Prof. Marco Avvenuti (University of Pisa)

2007 - 2010

B.Sc. in Computer Engineering

University of Pisa, Italy

Thesis: “Design and realisation of a software module for PLC/PAC remote controlling”
Supervisor: Prof. Aldo Balestrino (University of Pisa)

2003 - 2007


Programming Languages & Tools
Programming skills: Python, Java, JavaScript, C/C++, Bash scripting. Proficiency with Apache Maven and VCSs (SVN & GIT). Advanced knowledge of Visual Studio Code, IntelliJ, Eclipse, NetBeans IDEs. Data science frameworks & tools: Python notebooks, Pandas; big data: Hadoop ecosystem, Hive, Spark, Map Reduce; complex networks: NetworkX, iGraph; machine learning: scikit-learn, Tensorflow/Keras; NLP: Gensim, NLTK; visualisation: pyplot, plotly. Java frameworks: proficiency with Spring framework (Core, MVC, Security), JPA and Hibernate. Web development & JS frameworks: JQuery, Bootstrap, AngularJS, CSS, Django, JSP. Application monitoring: ELK stack (Logstash, ElasticSearch, Kibana), Prometheus. Markup languages: HTML, XHTML, XML (XML Schema, XPath, XQuery, XSL), LaTeX. Web services and web applications: SOAP, WSDL, WADL, JAX-WS, REST, JAX-RS, Apache CXF, Jersey, Apache HTTP server and Apache Tomcat, Jetty, ServiceMix, OSGi; Data management systems: proficiency with both SQL and noSQL paradigms; configuration and optimisation of databases; relational databases: PosgreSQL, MySQL, SQLite; document stores: eXist, MongoDB; triple stores: GraphDB, BlazeGraph; k-v stores: Redis, Voldemort, EhCache; time series databases: InfluxDB, Druid; full-text indexes: Lucene/Solr, ElasticSearch. Networking: Networking and Internet protocols. Wireless Networking and mobility. Inter/Intra-domain routing. Network security. Cisco CCNA certification.


Love inline skating and freestyle slalom (instructor license), windsurfing & SUP, skiing, travelling, playing electric guitar, folding origami, shooting pictures, reading Scandinavian crime novels and fine cooking.

Awards, Scholarships & Committees

  • Best Paper Award at The Web Conference 2018 (SAVE-SD workshop)
  • $20,000 Azure4research grant (Microsoft Research), 2018
  • GYM: Grants for Young Mobility (ISTI-CNR), 2016
  • Early Career Scientists programme (Research Data Alliance, RDA), 2014

  • Invited member of the steering committee
  • Program committee member for several international conferences and workshops such as K-CAP 2017-2019, ESWC 2019, The Web Conference 2018-2019, ISWC 2017, Re:coding Black Mirror 2017-2019 workshop series, EAGLE.