Many-to-many relationship RavenDb: Document structure and index

Question

How to you build a NoSQL model and index (preferably for RavenDb v4) for the following relational schema?

Document type Contact, where each record can have multiple additional properties (type of the property is defined in CustomField and the value in ContactCustomField)

Considering a need to filter/sort on the highlighted fields in one query (all fields from the Contact plus custom fields).

Possible options as I see:

Option #1

Naturally, I'd imagine the following persistent models:

public class Contact
{
    public string Id      { get; set; }
    public string Name    { get; set; }
    public string Address { get; set; }
    public string Phone   { get; set; }
    // Where the key is CustomField.Id and the value is ContactCustomField.Value
    public Dictionary CustomValues { get; set; }
}

public class CustomField
{
    public string Id          { get; set; }
    public string Code        { get; set; }
    public string DataType    { get; set; }
    public string Description { get; set; }
}

However, building an index for a query like below (sorry for a mixed syntax) puzzles me:

SELECT Name, Address, Phone, CustomValues
FROM Contact
WHERE Name LIKE '*John*' AND CustomValues.Any(v => v.Key == "11" && v.Value == "student")

Option #2

Another approach would be keeping a normalised structure (as shown on the picture above). Then it would work - I'd just have to include ContactCustomField in the query for Contact.

The downside would be not utilising benefits of NoSQL.

Alex Klaus · Accepted Answer

Updated answer (29 June 2018)

The key to success is in one undervalued Raven's feature - Indexes with Dynamic Fields. It allows to keep logical data structure and avoids creating a fanout index.

The way to use is to build collections like described above in option #1:

public class Contact
{
    public string Id      { get; set; }
    public string Name    { get; set; }
    public string Address { get; set; }
    public string Phone   { get; set; }
    public Dictionary CustomFields { get; set; }
}

public class CustomField
{
    public string Id          { get; set; }
    public string Code        { get; set; }
    public string DataType    { get; set; }
    public string Description { get; set; }
}

where Contact.CustomFields.Key is a reference to CustonField.Id and Contact.CustomFields.Value stores a value for that custom field.

In order to filter/search on the custom fields, we need the following index:

public class MyIndex : AbstractIndexCreationTask
{
    public MyIndex()
    {
        Map = contacts =>
            from e in contacts
            select new
            {
                _ = e.CustomFields.Select( x => CreateField ($"{nameof(Contact.CustomFields)}_{x.Key}", x.Value))
            };
    }
}

That index will cover all key-value pairs of the dictionary as they were ordinary properties of the Contact.

Gotcha

There is a big gotcha if you write queries in C# with using the usual Query object (IRavenQueryable type), rather than RQL or DocumentQuery. It's in the way we named the dynamic fields - it's a compound name in specific format: dictionary_name + underscore + key_name. It allows us to build queries like

var q = s.Query()
                .Where(p => p.CustomFields["Age"].Equals(4));

Which under the hood gets converted into RQL:

from index 'MyIndex' where CustomFields_Age = $p1

It's undocumented and here is my discussion with Oren Eini (aka Ayende Rahien) where you can learn more on the subject.

P.S. My general recommendation would be to interact with Raven via DocumentQuery rather than the usual Query (link), as LINQ integration is still quite weak and devs may keep stumbling upon bugs here and there.

Initial answer (9 June 2018)

As it was suggested by Oren Eini (aka Ayende Rahien), the way to go is option #2 - including a separate ContactCustomField collection in the queries.

So in spite of using a NoSQL database, relational approach is the only way to go here.

Many-to-many relationship RavenDb: Document structure and index

Answers (2)

Related Questions