1. Weiyi Meng, and Hai He. Data Search Engine. In Encyclopedia of
Computer Science and Engineering (Benjamin Wah, ed.), John Wiley & Sons,
pp.826-834, January 2009.
2. Thomas A. Phelps, Robert Wilensky, 2000.
Robust Hyperlinks Cost Just Five Words Each. Technical Report: CSD-00-1091.
Publisher: University of California at Berkeley.
3. Martin Klein, Michael L.
Nelson. 2008. Revisiting Lexical
Signatures to (Re-)Discover Web Pages. Proceedings of the 12th European
conference on Research and Advanced Technology for Digital Libraries Pages: 371
– 382
4. Seung-Taek, David M. Pennock, C. Lee Giles,
Robert Krovetz, 2002. Analysis of Lexical Signatures for Finding Lost or
Related Documents SIGIR' 02, August 11-15, 2002, Tampere, Finland.
5. Seung-Taek, David M. Pennock, C. Lee Giles,
Robert Krovetz, 2004. Analysis of Lexical Signatures for Improving Information
Persistence on the World Wide Web. ACM Transactions on Information Systems,
Vol. 22, No. 4, October 2004, Pages 540–572.
6. M. Cutler, Y. Shih, W. Meng. 1997. Using
the Structure of HTML Documents to Improve Retrieval. USENIX Symposium on
Internet Technologies and Systems.
7. M. Cutler, H. Deng, S. S.
Maniccam, W. Meng. Tools with
Artificial Intelligence, 1999. A new study on using HTML structures to improve
retrieval. Proceedings. 11th IEEE International Conference on Volume , Issue ,
1999 Page(s):406 – 409.
8. J. Lu, Y. Shih, W. Meng and M. Cutler.
1996. Web-based Search Tool for Organization Retrieval. http://nexus.data.binghamton.edu/~yungming/webor.html
9. Rada Mihalcea and Paul
Tarau. 2004. TextRank: Bring
Order into Texts. Proceedings of EMNLP 2004, pages 404–411, Barcelona, Spain.
10. Rada Mihalcea. 2004. Graph-based Ranking
Algorithms for Sentence Extraction, Applied to Text Summarization. In
Proceedings of the 20th International Conference on Computational Linguistics
(COLING 2004), Geneva, Switzerland.
11. Xiaojun Wang, Jianwu Yang. 2006.
WordRank-Based Lexical Signatures for Finding Lost or Related Web Pages. APWeb
2006, LNCS 3841, pp. 843-849.
12. Larry Page. 1998. The PageRank Citation
Ranking: Bringing Order to the Web. Computer Networks and ISND Systems.
13. Jon M. Kleinberg. 1999. Authoritative
Sources in a HyperLinked Environment. Journal of the ACM, 46(5): 604-632.
14.
WordNet, http://wordnet.princeton.edu/
15. C.Y.Lin and E.H.Hovy. 2003. Automatic
evaluation of summaries using n-gram co-occurrence statistics. In Proceedings
of Human Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada,
May.
16. Weiyi Meng, Clement Yu, and King-Lup Liu.
2002. Building Efficient and Effective Metasearch Engines. ACM Computing Surveys,
Vol. 34, No. 1, March 2002, pp.48-89.
17. Weiyi Meng, Zonghuan Wu, Clement Yu, and
Zhuogang Li. 2001. A Highly-Scalable and Effective Method for Metasearch. ACM
Transactions on Information Systems 19(3), pp.310-335, July 2001.
18. Michael K. Bergman. 2001. White Paper: The
Deep Web: Surfacing Hidden Value. BrightPlanet. Ann Arbor, MI: Scholarly
Publishing Office, University of Michigan, University Library vol. 7, no. 1,
August, 2001
19. Martin Klein, Michael L.
Nelson. 2008. A comparison of
techniques for estimating IDF values to generate lexical signatures for the web.
Workshop on Web Information and Data Management. Proceeding of the 10th ACM
workshop on Web information and data management. Napa Valley, California,
USA. SESSION:
System issues. Pages 39-46.
20. Crunch, http://www.psl.cs.columbia.edu/crunch/