70% of Github is duplicate code – Study

24 November 2017

A new study has found that around 70% of the code on Github is duplicated, The Register reported.

Researchers originally set out to try and define how much files changed between different clones, but they ended up discovering a very high rate of file-level duplication that caused them to change direction.

Conducted by an international team of eight researchers and led by the University of California at Irvine, the research ultimately found that out of 428 million files on GitHub, only 85 million are unique.

The report stated that these findings have significant implications for research which relies on data from Github, as it would need to take this duplication into account.

Now read: Uber hack shows vulnerability of software code-sharing services

You have read 3 out of 5 free articles. Log in or register for unlimited access.
  1. tjitah
    25.11.2017 at 11:39

    I have no idea what this is about????

Read now

The best gaming website in South Africa
MyGaming proudly displays the “FAIR” stamp of the Press Council of South Africa, indicating our commitment to adhere to the Code of Ethics for Print and online media which prescribes that our reportage is truthful, accurate and fair. Should you wish to lodge a complaint about our news coverage, please lodge a complaint on the Press Council’s website, www.presscouncil.org.za or email the complaint to [email protected] Contact the Press Council on 011 4843612.