(One intermediate revision by the same user not shown) | |||
Line 15: | Line 15: | ||
=== D === | === D === | ||
* Datar, Mayur, et al. "Locality-sensitive hashing scheme based on p-stable distributions." 2004. [http://www.cs.princeton.edu/courses/archive/spring05/cos598E/bib/p253-datar.pdf] [[Locality Sensitive Hashing]], [[Euclidean LSH]] | * Datar, Mayur, et al. "Locality-sensitive hashing scheme based on p-stable distributions." 2004. [http://www.cs.princeton.edu/courses/archive/spring05/cos598E/bib/p253-datar.pdf] [[Locality Sensitive Hashing]], [[Euclidean LSH]] | ||
+ | * De Kok D., Brouwer H. "Natural language processing for the working programmer", 2011. [[Collocation Extraction]] | ||
* De Smet, Yves. "An introduction to multicriteria decision aid: The PROMETHEE and GAIA methods." [[PROMETHEE]] | * De Smet, Yves. "An introduction to multicriteria decision aid: The PROMETHEE and GAIA methods." [[PROMETHEE]] | ||
* Deerwester, Scott C., et al. "Indexing by latent semantic analysis." 1990. [http://www.cob.unt.edu/itds/faculty/evangelopoulos/dsci5910/LSA_Deerwester1990.pdf] [[Latent Semantic Analysis]] | * Deerwester, Scott C., et al. "Indexing by latent semantic analysis." 1990. [http://www.cob.unt.edu/itds/faculty/evangelopoulos/dsci5910/LSA_Deerwester1990.pdf] [[Latent Semantic Analysis]] | ||
Line 46: | Line 47: | ||
* Landauer, T. et al. "An introduction to latent semantic analysis." 1998. [http://tottdp.googlecode.com/files/LandauerFoltz-Laham1998.pdf] [[Latent Semantic Analysis]] | * Landauer, T. et al. "An introduction to latent semantic analysis." 1998. [http://tottdp.googlecode.com/files/LandauerFoltz-Laham1998.pdf] [[Latent Semantic Analysis]] | ||
* Larsen, Bjornar et al. "Fast and effective text mining using linear-time document clustering." 1999. [http://comminfo.rutgers.edu/~muresan/IR/Docs/Articles/sigkddLarsen1999.pdf] [[Document Clustering]] | * Larsen, Bjornar et al. "Fast and effective text mining using linear-time document clustering." 1999. [http://comminfo.rutgers.edu/~muresan/IR/Docs/Articles/sigkddLarsen1999.pdf] [[Document Clustering]] | ||
+ | * Lee, K., Lee, Y. et al "Parallel data processing with MapReduce: a survey" 2012. [http://www.cs.arizona.edu/~bkmoon/papers/sigmodrec11.pdf] [[Hadoop]], [[MapReduce]], [[Hadoop MapReduce]] | ||
+ | * Lee K., Jalali A., Dasdan A. "Real time bid optimization with smooth budget delivery in online advertising", 2013. [https://arxiv.org/abs/1305.3011] [[Budget Pacing]] | ||
* Li, Yong H., et al. "Classification of text documents." 1998. [http://julio.staff.ipb.ac.id/files/2014/09/LiJ98.pdf] [[Term Clustering]] | * Li, Yong H., et al. "Classification of text documents." 1998. [http://julio.staff.ipb.ac.id/files/2014/09/LiJ98.pdf] [[Term Clustering]] | ||
* Liu, Tao, et al. "An evaluation on feature selection for text clustering." 2003. [http://www.aaai.org/Papers/ICML/2003/ICML03-065.pdf] [[Term Contribution]] | * Liu, Tao, et al. "An evaluation on feature selection for text clustering." 2003. [http://www.aaai.org/Papers/ICML/2003/ICML03-065.pdf] [[Term Contribution]] | ||
Line 51: | Line 54: | ||
=== M === | === M === | ||
+ | Manning C., Schütze H. "Foundations of statistical natural language processing", 1999. [[Collocation Extraction]] | ||
+ | |||
=== N === | === N === | ||
=== O === | === O === | ||
− | * Oikonomakou, | + | * Oikonomakou, N, Vazirgiannis, M. "A review of web document clustering approaches." Data mining and knowledge discovery handbook. 2010. [https://scholar.google.com/scholar?cluster=1261203777431390097&hl=ru&as_sdt=0,5] [[Cluster Analysis]] [[Agglomerative Clustering]] [[K-Means]] |
− | * Osinski, | + | * Osinski, S. "Improving quality of search results clustering with approximate matrix factorisations." 2006. [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.74&rep=rep1&type=pdf] [[Non-Negative Matrix Factorization]] |
+ | * Ordonez, C, et al, "Relational versus non-relational database systems for data warehousing." 2010. [http://www2.cs.uh.edu/~ordonez/w-2010-DOLAP-relnonrel.pdf] [[Hadoop]], [[Hadoop MapReduce]] | ||
=== P === | === P === | ||
− | * Pagael, | + | * Pagael R, Schubotz M. "Mathematical Language Processing Project." 2014. [http://arxiv.org/abs/1407.0167] [[Mathematical Definition Extraction]] [[Math-Aware POS Tagging]] |
− | * Paulevé, | + | * Paulevé, L. et al. "Locality sensitive hashing: A comparison of hash function types and querying mechanisms." 2010. [https://hal.inria.fr/inria-00567191/document] [[Locality Sensitive Hashing]], [[K-Means LSH]] |
+ | * Petrović S. et al. "Comparison of collocation extraction measures for document indexing", 2006. [http://bib.irb.hr/datoteka/251298.110-4-157-203.pdf] | ||
Only papers I read and used as sources (or small books that don't deserve a separate wiki page)
Manning C., Schütze H. "Foundations of statistical natural language processing", 1999. Collocation Extraction