Talk:Graph database#rfc 258B6C5
{{talk header}}
{{WikiProject banner shell|class=C|
{{WikiProject Databases |importance=Mid}}
{{WikiProject Software |importance=Low |computing-importance=Low}}
}}
{{COI editnotice}}
{{Broken anchors|links=
Teradata The anchor (#Aster Platform) has been deleted.
Aster
}}
{{User:ClueBot III/ArchiveThis
| age=2160
| archiveprefix=Talk:Graph database/Archive
| numberstart=1
| maxarchsize=75000
| header={{Automatic archive navigator}}
| minkeepthreads=5
| minarchthreads=2
| format= %%i
}}
The [[OpenCog]] AtomSpace
{{edit COI|ans=yes}}
Please add the following entry to the list (see table below). The OpenCog [https://wiki.opencog.org/w/AtomSpace AtomSpace] is one of the early graph databases, dating back to 2005 or earlier. Its described in multiple books by Ben Goertzel, including "The Hidden Pattern", "The Structure of Intelligence", "Engineering General Intelligence", etc.
class="wikitable sortable" | ||||
Name | Version | License | Language | Description |
---|---|---|---|---|
AtomSpace | 5.0.3 (Dec 2018) | AGPL | C++, Python, Scheme | Knowledge representation store, for logical inferencing, term rewriting, learning and other generic AI algorithms.{{cite | last = Goertzel | first = Ben | date = 1993 | title = The Structure of Intelligence| publisher=Springer-Verlag|isbn = 978-0-387-94004-5 |doi=10.1007/978-1-4612-4336-6 }}{{cite | last = Goertzel | first = Ben | date = 1993 | title = The Evolving Mind| publisher=Gordon and Breach |isbn = 978-2881245879| doi = 10.5860/choice.31-5976}} |
I think it's notable as having been one of the earliest graph databases that got heavily used in AI research; its got a pile of features and functions that none of the other listed databases have discovered yet (although its clear that some of these, e.g. Gremlin, is starting to move in that direction, as well as some of the newest google and amazon products are evolving in that direction as well. Thundering herd effect.) 67.198.37.16 (talk) 21:44, 23 December 2018 (UTC)
=Reply 25-DEC-2018=
{{border | 20px Clarification needed |display=table |width=1px |style=double |style2=dotted |color=black |lh=1}}
- The required disclosure has not been made by the COI editor.
- The proposal is unreferenced.{{efn|References for the proposed claim have not been formatted according to the citation style used by the article.}}
When ready to proceed with this information, please alter the {{tlx|request edit}} template's answer parameter to read from {{para|ans|yes}} to {{para|ans|no}}.
Regards, Spintendo 23:50, 25 December 2018 (UTC)
{{notelist-talk}}
I'm not sure I get it. For the COI, it says to fill in the following template: {{Connected contributor|User1=User talk:67.198.37.16 |U1-declared=yes| U1-otherlinks=To the best of my recollection, I have never actually edited this particular article. I have made about 35 thousand edits to Wikipedia over the last 20 years. These are primarily on Mathematics and Physics; see User talk:67.198.37.16 for a list of articles I have made major contributions to. I have published journal articles and papers on many/most of the topics for which I edit Wikipedia. Sometimes I give public talks on these topics. A partial publication list can be found at, for example, https:///linas.org/math . The requested listing above is for the OpenCog AtomSpace, to which I am a primary contributor; see for example, https://github.com/opencog and also https://github.com/linas I have also worked at several corporations that have used the OpenCog AtomSpace in their products, including Northrup Grumman, Hanson Robotics, MOZI and Poulin Holdings. I have been involved with Huawei during their product evaluation of the AtomSpace. I count the inventor of the AtomSpace, Ben Goertzel, as a personal friend.}}
{{reflist-talk}}
Properties section
The Properties section seems to be quite a jumble of disparate material. The "Storage" and "Index-free" subsections are about lower level details, whereas the "Graph types" subsection rather appears to be about what kind of information may be found in the database. And there is a strong business slant (as opposed to say science) in the choice of categories, so presumably the cited categorisation had a different scope than this article. 37.2.117.154 (talk) 09:10, 30 March 2023 (UTC)
: Made "Graph types" subsection a section of its own, and renamed it. 130.243.94.123 (talk) 14:04, 13 April 2023 (UTC)
Comparison with relational databases
The example of searching for friends of Jack appears to be skewed to me. The graph database examples start by finding the person(s) named Jack and then finds friends of these, but the SQL example first builds a table of all pairs of friends, and only at the end filters out those where the first party isn't named Jack (even if query optimisation may change the order of operations). I think something like
SELECT p2.person_name FROM
(SELECT * FROM people p1 WHERE p1.person_name = 'Jack')
JOIN friend ON (p1.person_id = friend.person_id)
JOIN people p2 ON (p2.person_id = friend.friend_id)
;
might make a more fair comparison (but I'm not confident enough with my SQL that I trust I got the syntax right).
Also, a discussion in a general article such as this one should probably spell out how one even would store graph-like data in a relational database, before getting into pros and cons. Assuming triples, I suppose two approaches would be:
- Make one big table with columns subject, predicate, object.
- Make a separate table for each predicate, with columns subject, object.
The second approach would require changing the schema whenever adding a new predicate, which could be considered a big deal. The first suggests naive estimates for any basic operation would depend on total number of edges in the database (rather than, say, vertex degree), which is probably challenging.
37.2.117.154 (talk) 09:46, 30 March 2023 (UTC)
:A long and detailed explanation for what you are asking about/looking for is given here: https://github.com/opencog/atomspace/raw/master/opencog/sheaf/docs/ram-cpu.pdf it goes into all the hows and whys of how this stuff works. 67.198.37.16 (talk) 04:16, 5 May 2024 (UTC)
More list columns
The list of graph databases currently has a strong focus on these as software products — version, release date, licence, and implementation(?) languages all have separate columns — but there is no structured information about the capabilities of these systems! Two columns I'd like to suggest are:
- Graph model: The two main models appear to be property graph and RDF, but it is in many descriptions very difficult to tell which, if indeed any, of these two are supported. And then there is the possibility of minor variants, such as RDF versus RDF-star…
- Query API: A lot of the systems seem to promote their own solution here, but is that all they offer? It is probably useful if a system provides a SPARQL endpoint, even if its native model is not RDF. Some of the interfaces appear to be actual query languages (like SQL: not what you write your app in, but an actual separate language used for stating queries), whereas others are more APIs for building and using query objects.
130.243.94.123 (talk) 14:56, 13 April 2023 (UTC)
OpenCog AtomSpace?
The OpenCog AtomSpace is about 20-25 years old, and is one of the first, if not the first graph database to hit the market (copyright notices give 2002 as the date) Sort of disappointed to not see it mentioned in the listing here. OpenSource: https://github.com/opencog/atomspace It transitioned from proprietary to open source in 2008. 67.198.37.16 (talk) 04:12, 5 May 2024 (UTC)