KRISTAL Home (Korean) English   
KRISTAL Ȩ | °Ô½ÃÆÇ | K-Lab | ÀÚ·á½Ç | ¿¬¶ôó | KISTI ¼­ºñ½º Q&A | Semantic KRISTAL | »ç¿ëÀÚ ¸¸Á·µµ ¼³¹®Á¶»ç
KRISTAL IRMS
Ȱ¿ë»çÀÌÆ®
¶óÀ̼¾½º
°ü·Ã¹®¼­
´Ù¿î·Îµå
°Ô½ÃÆÇ
K-Lab
±â¼úÇù·Â¾÷ü
¿öÅ©¼ó
°ü·Ã»çÀÌÆ®
¼³¹®Á¶»ç
Àü¹®°¡Æò°¡±×·ì
GIIS ÀÎÆ®¶ó
KRISTAL À¯Áöº¸¼ö
¿ÀǼҽº Á¤º¸°Ë»ö°ü¸®½Ã½ºÅÛ KRISTAL-IRMS
Knowledge Retrieval In Science & Technology Affiliated Literatures - Information Retrieval Management System

KCrawler

KCrawler °³¿ä

    KCrawler ÀÏ¹Ý Web-Crawler µ¿ÀÏÇÑ ±â´ÉÀ» °¡Áö°í ÀÖÀ¸³ª ƯÁ¤ »çÀÌÆ® Crawlering¸¦ À§ÇÑ ¸ñÀûÀ¸·Î Á¦À۵Ǿú´Ù..
    KCrawler¸¦ ƯÁ¤ URL »çÀÌÆ®¸¦ ±âÁØÀ¸·Î CrawleringÇÏ¿© KRISTAL DB¿¡ ¹Ù·Î ÀûÀçÇÏ´Â ±â´ÉÀ» °¡Áö°í ÀÖ½À´Ï´Ù.

KCrawler ±¸Á¶

    KCrawler Structure

KCrawler ±â´É

  • KCrawler Robot Scheduler Task
      KCrawler Robot
    • Based KCrawlerConfig.xml
    • Multi-Thread
    • FIFO(URL) °ü¸®±â´É
    • ProcessedURL °ü¸® ±â´É
    • HTTP connection
    • URL Parser(Anchors)
    • HTML2TXT
    • SaveUTF(by option)
    • Run/Stop/Reset ±â´É
  • Update Robot Scheduler Task
      KCrawler update
    • Real-Time update to KRISTAL robot
    • Auto update-file to KRISTAL robot(not real-time)
  • Binary Robot Scheduler Task
    • Doc Filtering
    • Binary file downloading
  • KRISTAL Manager Task
    • KRISTAL Server auto-run/stop
    • KRIATAL Server Test-config/Checking alive
    • connection to KRISTAL Server
    • Delete ±â´É
  • Schema Task
    • Create table ±â´É
    • Section name mapping ±â´É
KCrawler crawlering KCrawler manager

½Ã½ºÅÛ È¯°æ

  • Windows/Linux + KRISTAL Version 3.x.x ÀÌ»ó

µ¥¸ð »çÀÌÆ®

KISTI © 2006. GIIS - Group for Intelligent Information Systems. Some rights may be reserved.
Powered By KRISTAL-IRMS
KISTI