ISBN の抽出の続き

もう飽きてきた。

紀伊国屋書店

>>> isbn10 = document.evaluate(
      '//input[@name="ISBN"]/@value',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"4873110971"
>>> isbn13 = document.evaluate(
      'substring(substring-after(//td[span[@class="BookTitle"]]/text()[4], "ISBN:"), 1, 13)',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"9784873110974"

PORTA

>>> isbn10 = document.evaluate(
      'translate(//input[@name="isbn" and string-length(@value)=13]/@value, "-", "")',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"4873110963"
>>> isbn13 = document.evaluate(
      'translate(//input[@name="isbn" and string-length(@value)=17]/@value, "-", "")',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"9784873113296"

楽天ブックス

>>> isbn13 = document.evaluate(
      'substring-after(//span[@class="item_number"]/text(), "ISBN:")',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"9784344013926"

ライブドアブックス

>>> isbn10 = document.evaluate(
      'substring-after(//p[@class="itemData"]/text()[4], "管理コード:")',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"4872578562"
>>> isbn13 = document.evaluate(
      'substring-after(//p[@class="itemData"]/text()[3], "ISBN:")',
      document, null, XPathResult.STRING_TYPE, null).stringValue
"9784872578560"