eduzhai > Applied Sciences > Computer Science >

Automatic Wrapper Adaptation by Tree Edit Distance Matching

  • paiqiu
  • (0) Download
  • 20210803
  • Save

... pages left unread,continue reading

Document pages: 14 pages

Abstract: Information distributed through the Web keeps growing faster day by day,and for this reason, several techniques for extracting Web data have been suggestedduring last years. Often, extraction tasks are performed through so called wrappers,procedures extracting information from Web pages, e.g. implementing logic-basedtechniques. Many fields of application today require a strong degree of robustnessof wrappers, in order not to compromise assets of information or reliability of dataextracted.Unfortunately, wrappers may fail in the task of extracting data from a Web page, ifits structure changes, sometimes even slightly, thus requiring the exploiting of newtechniques to be automatically held so as to adapt the wrapper to the new structureof the page, in case of failure. In this work we present a novel approach of automatic wrapper adaptation based on the measurement of similarity of trees throughimproved tree edit distance matching techniques.

Please select stars to rate!


0 comments Sign in to leave a comment.

    Data loading, please wait...