XSD restriction that negates a matching string

前端 未结 2 1861
小鲜肉
小鲜肉 2021-01-23 22:33

I want my XSD to validate the contents of a string. To be specific, I want to validate that a certain string does not occur.

Consider this rule, which

相关标签:
2条回答
  • 2021-01-23 22:52

    You don't mention whether you are bound to XML Schema 1.0 and XPath 1.0, but if not it is possible to accomplish your goal with xs:assert's, along the lines of this (which may need some work - this is from memory...):

    <xs:element name="Link" type="xs:normalizedString" minOccurs="0">
      <xs:simpleType>
        <xs:restriction base="xs:token">
          <xs:assert test="not( fn:starts-with( $value , '/site/example.com' ) )" />
        </xs:restriction>
      </xs:simpleType>
    </xs:element>  
    

    Some links of possible interest:

    http://www.ibm.com/developerworks/library/x-xml11pt2/

    http://www.w3.org/TR/xpath-functions/#func-starts-with

    Cheers,

    0 讨论(0)
  • 2021-01-23 22:53

    This is simpler to do in XSD 1.1, where you can use assertions to ensure that the value does not begin with the string you specify. But conceptually, it's simple enough even in XSD 1.0 and simple regular expressions: you want to ensure that the string does not begin with "/site/example.com". If it did begin that way, you'd have a logical conjunction of a series of facts about the string:

    • substring(., 1, 1) = '/'
    • substring(., 2, 1) = 's'
    • substring(., 3, 1) = 'i'
    • ...
    • substring(. 17, 1) = 'm'

    You want to negate this conjunction of facts. Now, by De Morgan's Laws, ~(a and b and ... and z) is equivalent to (~a or ~b or ... or ~z). So you can do what you need by writing a disjunction of the following terms:

        [^/].*
        |.([^s].*)?
        |.{2}([^i].*)?
        |.{3}([^t].*)?
        |.{4}([^e].*)?
        |.{5}([^/].*)?
        |.{6}([^e].*)?
        |.{7}([^x].*)?
        |.{8}([^a].*)?
        |.{9}([^m].*)?
        |.{10}([^p].*)?
        |.{11}([^l].*)?
        |.{12}([^e].*)?
        |.{13}([^\.].*)?
        |.{14}([^c].*)?
        |.{15}([^o].*)?
        |.{16}([^m].*)?
    

    In each term above the subexpression of the form [^s].* has been wrapped in (...)? -- the term .{2}([^i].*)? means any string beginning with two characters is OK if the third character is not an i or if there is no third character at all. This ensures that strings shorter than 17 characters in length are not excluded, even if they happen to be prefixes of the forbidden string.

    Of course, to use this in an XSD schema document, you will need to remove all the whitespace, which makes the regex harder to read.

    [Addition, June 2016] See also this related and more general question.

    0 讨论(0)
提交回复
热议问题