Paging in a Rest Collection

前端未结

关注

 12  1790

I\'m interested in exposing a direct REST interface to collections of JSON documents (think CouchDB or Persevere). The problem I\'m running into is how to handle the G


                      
              相关标签:


      
      
        
          12条回答        

        
                         				            
            
           
            
                              
                
              
              
                
                  无人及你        
                
              
                            
                2020-12-02 04:36
              
            
            
                                                                       
With the publication of rfc723x, unregistered range units do go against an explicit recommendation in the spec. Consider rfc7233 (deprecating rfc2616):

"New range units ought to be registered with IANA"  (along with a reference to a HTTP Range Unit Registry).
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  离开以前        
                
              
                            
                2020-12-02 04:37
              
            
            
                                                                       
My gut feeling is that the HTTP range extensions aren't designed for your use case, and thus you shouldn't try. A partial response implies 206, and 206 must only be sent if the client asked for it.

You may want to consider a different approach, such as the one use in Atom (where the representation by design may be partial, and is returned with a status 200, and potentially paging links). See RFC 4287 and RFC 5005.
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  天命终不由人        
                
              
                            
                2020-12-02 04:37
              
            
            
                                                                       
If there is more than one page of responses, and you don't want to offer the whole collection at once, does that mean there are multiple choices?

On a request to /db/questions, return 300 Multiple Choices with Link headers that specify how to get to each page as well as a JSON object or HTML page with a list of URLs.

Link: <>; rel="http://paged.collection.example/relation/paged"
Link: <>; rel="http://paged.collection.example/relation/paged"
...


You'd have one Link header for each page of results (an empty string means the current URL, and the URL is the same for each page, just accessed with different ranges), and the relationship is defined as a custom one per the upcoming Link spec.  This relationship would explain your custom 266, or your violation of 206.  These headers are your machine-readable version, since all of your examples require an understanding client anyway.

(If you stick with the "range" route, I believe your own 2xx return code, as you described it, would be the best behavior here.  You're expected to do this for your applications and such ["HTTP status codes are extensible."], and you have good reasons.)

300 Multiple Choices says you SHOULD also provide a body with a way for the user agent to pick.  If your client is understanding, it should use the Link headers.  If it's a user manually browsing, perhaps an HTML page with links to a special "paged" root resource that can handle rendering that particular page based on the URL?  /humanpage/1/db/questions or something hideous like that?



The comments on Richard Levasseur's post remind me of an additional option: the Accept header (section 14.1).  Back when the oEmbed spec came out, I wondered why it hadn't been done entirely using HTTP, and wrote up an alternative using them.

Keep the 300 Multiple Choices,  the Link headers and the HTML page for an initial naive HTTP GET, but rather than use ranges, have your new paging relationship define the use of the Accept header.  Your subsequent HTTP request might look like this:

GET /db/questions HTTP/1.1
Host: paged.collection.example
Accept: application/json;PagingSpec=1.0;page=1


The Accept header allows you to define an acceptable content type (your JSON return), plus extensible parameters for that type (your page number).  Riffing on my notes from my oEmbed writeup (can't link to it here, I'll list it in my profile), you could be very explicit and provide a spec/relation version here in case you need to redefine what the page parameter means in the future.
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  太阳男子        
                
              
                            
                2020-12-02 04:40
              
            
            
                                                                       
Seems to me that the best way to do this is to include range as query parameters.  e.g., GET /db/questions/?date>mindate&date<maxdate.  Upon a GET to the /db/questions/ with no query parameters, return 303 with  Location: /db/questions/?query-parameters-to-retrieve-the-default-page.  Then provide a different URL by which whomever is consuming your API to get statistics about the collection (e.g., what query parameters to use if s/he wants the entire collection);
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  攒了一身酷        
                
              
                            
                2020-12-02 04:42
              
            
            
                                                                       
One of the big problems with range headers is that a lot of corporate proxies filter them out.  I'd advise to use a query parameter instead.  
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  无人及你        
                
              
                            
                2020-12-02 04:42
              
            
            
                                                                       
While its possible to use the Range header for this purpose, I don't think that was the intent.  It seems to have been designed for handling flaky connections as well as limiting the data (so the client can request part of the request if something was missing or the size was too large to process).   You are hacking pagination into something that is likely used for other purposes at the communication layer.
The "proper" way to handle pagination is with the types you return.   Rather than returning questions object, you should be returning a new type instead.  

So if questions is like this:

<questions>
  <question index=1></question>
  <question index=2></question>
  ...
</questions>

The new type could be something like this:

<questionPage>
  <startIndex>50</startIndex>
  <returnedCount>10</returnedCount>
  <totalCount>1203</totalCount>
  <questions>
    <question index=50></question>
    <question index=51></question>
    ..
  </questions>
<questionPage>

Of course you control your media types, so you can make your "pages" a format that suits your needs.  If you make is something generic, you can have a single parser on the client to handle paging the same for all types.  I think that is more in the spirit of the HTTP specification, rather than fudging the Range parameter for something else.
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
   
          
     上一页
1
2
           
           
        
                                  
        
        
          
            
            
              
              
            
    


                                 
              
            
                          
    

        
         
                验证码
                
                  
                
                
                   看不清?
                
              
                                  
                    
   
                 
             
              提交回复