Sourcegraph
{{Short description|Code intelligence platform}}{{Notability|Companies|date=January 2023}}{{Infobox company
| name = Sourcegraph
| logo = Sourcegraph-logo-light.svg
| logo_upright =
| logo_alt =
| logo_caption =
| type = Private
| industry = Information technology
| predecessor =
| founded = 2013
| founder = Quinn Slack and Beyang Liu
| defunct =
| fate =
| successor =
| hq_location_city =
| hq_location_country =
| area_served =
| key_people =
| products = Cody, Code Search
| owner =
| num_employees =
| num_employees_year =
| parent =
| website = {{URL|https://about.sourcegraph.com}}
}}
Sourcegraph Inc. is a company developing code search and code intelligence tool that semantically indexes and analyzes large codebases so that they can be searched across commercial, open-source, local, and cloud-based repositories.{{Cite web |last=Hoyt |first=Ben |date=2020-08-17 |title=Searching code with Sourcegraph |url=https://lwn.net/Articles/828748/ |access-date=2022-10-03 |website=LWN.net}}
The company has two products available: Cody and Code Search. Code Search was initially released in 2013 under the name Sourcegraph, but was rebranded to Code Search when the company unveiled Cody in 2023. Both products support all major programming languages.{{Cite web |url=https://about.sourcegraph.com/blog/sourcegraph-3.0 |title=Announcing Sourcegraph 3.0 |last=Slack |first=Quinn |date=2019-02-08 |website=Sourcegraph official website |access-date=2022-11-18}}
History
Sourcegraph Inc. was founded in by Stanford graduates Quinn Slack and Beyang Liu to drive the development of a code search and code intelligence tool, formerly called Sourcegraph. It was first released in 2013{{Cite web |last=Sawers |first=Paul |date=2020-03-03 |title=Sourcegraph raises $23 million to bring universal code search to all developers |url=https://venturebeat.com/business/sourcegraph-raises-23-million-to-bring-universal-code-search-to-all-developers/ |access-date=2022-11-18 |website=VentureBeat}}{{Cite web |last=Salter |first=Jim |date=2020-10-01 |title=Sourcegraph: Devs are managing 100x more code now than they did in 2010 |url=https://arstechnica.com/gadgets/2020/10/sourcegraph-devs-are-managing-100x-more-code-now-than-they-did-in-2010/ |access-date=2022-11-18 |website=Ars Technica}} but was rebranded to Code Search in 2023. It was partly inspired by Liu's experience using Google Code Search while he was a Google intern,{{Cite podcast |url=https://changelog.com/podcast/217 |title=Sourcegraph the 'Google for Code' |website=Changelog |host=Adam Stacoviak |date=2016-08-16 |access-date=2022-11-21 }} It was designed to "tackle the big code problem" by enabling developers to manage large codebases that span multiple repositories, programming languages, file formats, and projects.{{Cite web |last=Sawers |first=Paul |date=2020-12-03 |title=Sourcegraph raises $50 million to tackle 'big code' problems with universal search |url=https://venturebeat.com/business/sourcegraph-raises-50-million-to-tackle-big-code-problems-with-universal-search/ |access-date=2022-11-21 |website=VentureBeat}}
Code Search was initially self-hosted by each customer on their own infrastructure.{{Cite web |last=Sawers |first=Paul |date=2021-08-19 |title=Sourcegraph plans to index the entire open source web |url=https://venturebeat.com/business/sourcegraph-plans-to-index-the-entire-open-source-web/ |access-date=2022-10-03 |website=VentureBeat |language=en-US}} Early customers included Uber, Dropbox, and Lyft.{{Cite web |url=https://about.sourcegraph.com/blog/enterprise-cloud |title=Sourcegraph Cloud: secure, scalable, dedicated instances for enterprises |last=Slack |first=Quinn |date=2022-09-27 |access-date=2022-11-21 |website=Sourcegraph Blog}} In 2016, Code Search was criticized{{Cite web |url=https://www.techrepublic.com/article/fair-source-licensing-is-the-worst-thing-to-happen-to-open-source-definitely-maybe/ |title=Fair Source licensing is the worst thing to happen to open source-definitely maybe |last=Asay |first=Matt |date=2016-04-01 |access-date=2022-11-21 |website=TechRepublic|archive-url=https://web.archive.org/web/20211107170119/https://www.techrepublic.com/article/fair-source-licensing-is-the-worst-thing-to-happen-to-open-source-definitely-maybe/ |archive-date=2021-11-07 }} for being provided with a Fair Source License with the developers explaining{{cite report |first=Nadia |last=Eghbal |date=2016 |title=Roads and bridges. The Unseen labor behind our digital infrastructure |url=http://brochures.sisalp.fr/roads-and-bridges-the-unseen-labor-behind-our-digital-infrastructure.pdf |pages=94–95 |access-date=2022-12-14}}{{Cite web |title=Fair Source License |url=https://fair.io/?a |access-date=2022-11-21 |website=Fair Source License official website}} that "all of Sourcegraph's source code is publicly available and hackable"{{Cite web |title=The Sourcegraph developer release: A better way to discover and understand code |url=https://about.sourcegraph.com/blog/the-sourcegraph-developer-release-a-better-way-to-discover-and-understand-code |date=2016-05-30 |access-date=2022-11-21 |website=Sourcegraph Blog |language=en}} and was intended to "help open sourcers strike a balance between getting paid and preserving their values".{{Cite magazine |url=https://www.wired.com/2016/03/former-open-sourcers-ask-companies-pay-fair-share/ |title=One Startup's Heretical Plan to Turn Open Source Code Into Cash |last=Finley |first=Klint |date=2016-03-29 |access-date=2022-11-21 |magazine=Wired}} In 2018, Code Search was licensed under the Apache License 2.0,{{Cite web |last=Schmidt |first=Julia |date=2018-10-02 |title=Sourcegraph pulls back the curtain, becomes open source project |url=https://devclass.com/2018/10/02/sourcegraph-becomes-open-source-project/ |access-date=2022-11-21 |website=DevClass}}{{Cite podcast |url=https://futureofcoding.org/episodes/032.html |title=Basic Developer Human Rights: Quinn Slack |website=Future of Coding |host=Steve Krouse |date=2019-10-24 |access-date=2022-11-21}} and Sourcegraph OSS has since been released under the Apache License 2.0. The commercial version, Code Search Enterprise, has been released under its own license.{{Cite web |title=Licensing |url=https://handbook.sourcegraph.com/departments/engineering/product/process/gtm/licensing/#sts=Talking%20about%20license,%20plans,%20users |access-date=2022-11-21 |website=Sourcegraph Handbook}} In 2023, Code Search was criticized{{Cite web |title=Seriously, don't sign a CLA |url=https://drewdevault.com/2023/07/04/Dont-sign-a-CLA-2.html |access-date=2023-07-05 |website=drewdevault.com}} for dropping the Apache license for most of its code, leaving it public but only available under its Enterprise license.{{Cite web |title=relicense all paths other than MIT licensed code, client/cody*, jetbr… · sourcegraph/sourcegraph@3cd931e |url=https://github.com/sourcegraph/sourcegraph/commit/3cd931ef54407c966fc3a5940a06f0b95a7aadd0 |access-date=2023-07-05 |website=GitHub |language=en}}
In 2019, Code Search was integrated into the GitLab codebase, giving GitLab users access to a browser-based developer platform.{{Cite web |title=Native code intelligence is coming to GitLab |url=https://about.gitlab.com/blog/2019/11/12/sourcegraph-code-intelligence-integration-for-gitlab/ |access-date=2022-10-03 |website=GitLab |language=en}} In 2021, a browser-based portal became available, allowing users to browse open-source projects and personal private code for free.
In 2022, Sourcegraph Cloud, a commercial single-tenant cloud solution for organizations with more than 100 developers, was launched.{{cite web |last=Slack |first=Quinn |date=2022-08-27 |title=Sourcegraph Cloud: secure, scalable, dedicated instances for enterprises |url=https://about.sourcegraph.com/blog/enterprise-cloud |access-date=2022-12-05 |website=Sourcegraph}}
Sourcegraph has raised a total of almost $225 million in financing to date. Its most recent $125 million Series D investment in 2021 valued the company at $2.625 billion, a 300% growth from its previous valuation in 2020.{{Cite web |last=Miller |first=Ron |date=2021-07-13 |title=Sourcegraph raises $125M Series D on $2.6B valuation for universal code search tool |url=https://techcrunch.com/2021/07/13/sourcegraph-raises-125m-series-d-on-2-6b-valuation-for-universal-code-search-tool/ |access-date=2022-10-03 |website=TechCrunch |language=en-US}}
class="wikitable"
! Date ! Funding Type ! Money Raised (USD) ! No. of Investors ! Lead Investor |
July 2021
| Series D round | 4 |
December 2020
| Series C round | 1 |
July 2020
| Series B round | 1 | Felicis Ventures |
March 2020
| Series B round | 3 |
October 2017
| Series A round | 3 | Goldcrest Capital, Redpoint |
In 2023 Sourcegraph Inc. unveiled their new product Cody, and rebranded Sourcegraph to Code Search.
Products
The company has two major products: Cody and Code Search.
= Sourcegraph Cody =
Cody is a free and open-source AI coding assistant that can help users write, fix, and maintain your code. It works by understanding an entire codebase and using that knowledge to provide context-aware assistance. including code generation, debugging, commenting, documentation, explaining, and answering questions regarding the code. Cody is available for Microsoft Visual Studio Code and most JetBrains IDEs.
= Sourcegraph Code Search =
{{Infobox software
| name = Code Search
| logo = Sourcegraph-logo-light.svg
| logo_size = 250px
| developer = Sourcegraph, Inc.
| released = 2013
| latest release version = 5.2.0
| latest release date = 3 October 2023
| repo = https://github.com/sourcegraph/sourcegraph
| programming language = Go, TypeScript
| operating system = Microsoft Windows, Linux, MacOS
| genre = Software intelligence
| license = Proprietary
| website = {{URL|https://about.sourcegraph.com}}
}}
Sourcegraph's "universal code search" tool is used to search, explore, and understand code.{{Cite web |last=Liu |first=Beyang |date=2020-01-15 |title=Sourcegraph: Universal code search and intelligence |url=https://www.infoworld.com/article/3514213/sourcegraph-universal-code-search-and-intelligence.html |website=InfoWorld |access-date=2022-12-05}} supports over 30 programming languages and integrates with GitHub and GitLab for code hosting, Codecov for code coverage, and Jira Software for project management.{{Cite web |date=2020-04-03 |title=Q&A: Sourcegraph's Universal Code Search Tool |url=https://spectrum.ieee.org/sourcegraph-universal-code-search-tool |access-date=2022-10-03 |website=IEEE Spectrum |language=en}} Code Search can be implemented across multiple repositories and code hosting platforms. Searches can be literal, regular expression, or structural. The structural search syntax is language-aware and handles nested expressions and multi-line statements better than regular expressions. Sourcegraph's Code Search uses a variant of Google's PageRank algorithm to rank results by relevance.{{Cite web |last=Yegge |first=Steve |date=2022-11-08 |title=Rethinking search results ranking on Sourcegraph.com |url=https://about.sourcegraph.com/blog/new-search-ranking |website=Sourcegraph |access-date=2022-12-06}} Code Search can be used to search and analyze all of an organization's code. During search indexing, the platform builds a global reference graph, that maps an entire codebase and enables functionality such as "go to definition".{{Cite web |last=Ramji |first=Sam |date=2022-12-22 |title=Google That Code: How Sourcegraph Simplifies Development |url=https://thenewstack.io/google-that-code-how-sourcegraph-simplifies-development/ |access-date=2023-02-20 |website=The New Stack}} Features include:
- Search: Code can be searched and navigated through the Sourcegraph web interface or through browser and IDE extensions and text editor plugins.
- Navigation: jumps to the definition of a variable or function, or find all references to it in a codebase.
- Batch Changes: Enables developers and companies to automate and track large-scale code refactoring, security fixes, and migrations across repositories and code hosts.{{Cite web |date=2021-03-24 |title=Sourcegraph Batch Changes Offers Automation for Large-Scale Code Changes |url=https://www.dbta.com/Editorial/News-Flashes/Sourcegraph-Batch-Changes-Offers-Automation-for-Large-Scale-Code-Changes-145930.aspx |access-date=2023-02-08 |website=Database Trends and Applications |language=en-US}}
- Code Insights: Extracts data from a codebase to provide detailed analytics and visualizations to track the health and progress of a code project.{{Cite web |last=Sawers |first=Paul |date=2022-03-10 |title=With Code Insights, Sourcegraph gives developers a better understanding of their codebase |url=https://venturebeat.com/dev/with-code-insights-sourcegraph-gives-developers-a-better-understanding-of-their-codebase/ |access-date=2022-10-03 |website=VentureBeat |language=en-US}}
Code search has received adoption by such various sectors as
- Research: Code search has been used to develop data mining methods for downstream dependencies{{Citation |last1=Thiede |first1=Christoph |title=17th International Conference on Evaluation of Novel Approaches to Software Engineering |year=2022 |contribution=Augmenting Library Development by Mining Usage Data from Downstream Dependencies |contribution-url=https://www.scitepress.org/PublicationsDetail.aspx?ID=cXF3UOIDuTY=&t=1 |publisher=ENASE |last2=Limberger |first2=Daniel |last3=Scheibel |first3=Willy |last4=Döllner |first4=Jürgen}} and to assist in refactoring and translating a program into its equivalent in another programming language.{{Cite thesis |type=MSc |last=Haavisto |first=Juuso |date=2020 |title=Leveraging APL and SPIR-V languages to write network functions to be deployed on Vulkan compatible GPUs |publisher=Université de Lorraine |url=https://hal.inria.fr/hal-03155647/document |access-date=2022-12-15}}
- Physics: Code search is used in the CERN Accelerator Control software community to index, quickly search, and generate statistics on code.{{cite journal |last1=Voirin |first1=R. |last2=Vanden Eynden |first2=M. |last3=Oulevey |first3=T. |date=2022 |title=The State of Containerization in CERN Accelerator Controls |url=https://cds.cern.ch/record/2809570 |journal=JACoW |volume=ICALEPCS |issue=2021 |pages=829–834 |doi=10.18429/JACoW-ICALEPCS2021-THBL03 |access-date=2022-12-15}}
- Cybersecurity: Code search has been used to gain better insight into source code during penetration testing.{{cite book |last=Rehberger |first=Johann |title=Cybersecurity Attacks – Red Team Strategies: A practical guide to building a penetration testing program having homefield advantage |publisher=Packt Publishing Ltd |year=2020 |isbn=9781838825508 |pages=216–224}}
As of July 2021, Code search customers include Adidas, Lyft, Uber, Yelp, Plaid, GE, Atlassian,{{Cite web |last=Miller |first=Ron |date=2021-07-13 |title=Sourcegraph raises $125M Series D on $2.6B valuation for universal code search tool |url=https://techcrunch.com/2021/07/13/sourcegraph-raises-125m-series-d-on-2-6b-valuation-for-universal-code-search-tool/ |access-date=2022-10-03 |website=TechCrunch |language=en-US}} Amazon, PayPal, Qualtrics, and Cloudflare.
See also
{{Div col|colwidth=30em|content=
}}