mongodb schema design for blogs

前端 未结 1 1173
独厮守ぢ
独厮守ぢ 2020-11-30 18:43

How would you design the schema for a blog-like site with document-based databases (mongodb). The site has the following objects: User, Article, Comment. User can add Commen

相关标签:
1条回答
  • 2020-11-30 18:58
    Article {
      "_id" : "A",
      "title" : "Hello World",
      "user_id" : 12345,
      "text" : 'My test article',
    
      "comments" : [
        { 'text' : 'blah', 'user_id' : 654321, 'votes' : [987654]},
        { 'text' : 'foo', 'user_id' : 987654, 'votes' : [12345, 654321] },
        ...
      ]
    }
    

    The basic premise here is that I've nested the Comments inside of the Article. The Votes only apply to a Comment, so they've been stored as an array with each Comment. In this case, I've just stored the user_id. If you want to store more information (time_created, etc.), then you can votes an array of objects:

    ... 'votes' : [ { user_id : 987654, ts : 78946513 } ] ...
    

    How to perform your queries efficiently:

    1. get Article A, comments on Article A and # of votes per comments
    db.articles.find( { _id : 'A' } )
    

    This gets everything with one query. You may have to do some client-side logic to count votes per comment, but this is pretty trivial.

    1. get all comments by User B across all articles
    db.articles.ensureIndex( { "comments.user_id" : 1 } )
    db.articles.find( { "comments.user_id" : 987654 } ) // returns all document fields
    

    The index will allow for efficiently searching the comments within a document.

    There's currently no way to extract only the matches from a sub-array. This query will in fact return all of the articles with comments by that user. If this is potentially way too much data, you can do some trimming.

    db.articles.find( { "comments.user_id" : 987654 }, { "title" : 1, "comments.user_id" : 1 })
    
    1. get all comments User B voted for
    db.articles.ensureIndex( { "comments.votes" : 1 } )
    db.articles.find( { "comments.votes" : 987654 } )
    

    Again, this will return all of the Articles, not just the comments.

    There's a trade-off to be made here. Returning the article may seem like we're bringing back too much data. But what are you planning to display to the user when you make query #3?

    Getting a list of "comments I've voted for" is not terribly useful without the comment itself. Of course the comment is not very useful without the article itself (or at least just the title).

    Most of the time, query #3 devolves into a join from Votes to Comments to Articles. If that's the case, then why not just bring back the Articles to start with?

    0 讨论(0)
提交回复
热议问题