Ir para o conteúdo
GovBR

Navigating technical, legal, and ethical hurdles to scraping LinkedIn data for academic research

  • In an era where professional career data is critical for analyzing occupational trends and organizational dynamics, LinkedIn data offers a rich corpus for academic research due to its expansive user base and frequent updates. This paper examines technical, legal, and ethical challenges associated with scraping LinkedIn profiles for research, arguing that scraping is the most effective method for acquiring comprehensive LinkedIn data compared to direct cooperation, purchasing data, or APIs. Despite prohibitive measures and potential legal issues outlined by LinkedIn, recent court decisions provide favorable precedents for the lawful scraping of public profiles. The paper also compiles prior research studies that leveraged LinkedIn data, highlighting various acquisition methods and their applicability to academic research. It explores strategies to ethically and legally navigate scraping, providing recommendations on how researchers can responsibly collect LinkedIn data, ensuring compliance with evolving privacy laws and ethical standards. Finally, technical considerations are discussed, emphasizing the use of tools like Selenium to overcome LinkedIn's sophisticated anti-scraping measures.

    Instituto Brasileiro de Informação em Ciência e Tecnologia (Ibict)

    Brasília, DF, Brasil
    Setor de Autarquias Sul (SAUS), Quadra 5, Lote 6, Bloco H
    70070-912
    www.ibict.br
    Rio de Janeiro, RJ, Brasil
    Programa de Pós-graduação em Ciência da Informação
    Rua Lauro Muller, 455 - 4º Andar - Botafogo
    22290-160
    www.ppgci.ufrj.br

    Contato

    Christine Alvarez

    • +55-21-3873-9454
    • liinc@ibict.br

    Liinc em Revista ISSN 1808-3536

    Liinc em Revista é licenciada sob CC BY 4.0

    Política de privacidade

    Platform and workflow by OJS/PKP

    Desenvolvido por Commscientia