Line 15: Line 15:
 
=== D ===
 
=== D ===
 
* Datar, Mayur, et al. "Locality-sensitive hashing scheme based on p-stable distributions." 2004. [http://www.cs.princeton.edu/courses/archive/spring05/cos598E/bib/p253-datar.pdf] [[Locality Sensitive Hashing]], [[Euclidean LSH]]
 
* Datar, Mayur, et al. "Locality-sensitive hashing scheme based on p-stable distributions." 2004. [http://www.cs.princeton.edu/courses/archive/spring05/cos598E/bib/p253-datar.pdf] [[Locality Sensitive Hashing]], [[Euclidean LSH]]
 +
* De Kok D., Brouwer H. "Natural language processing for the working programmer", 2011. [[Collocation Extraction]]
 
* De Smet, Yves. "An introduction to multicriteria decision aid: The PROMETHEE and GAIA methods." [[PROMETHEE]]
 
* De Smet, Yves. "An introduction to multicriteria decision aid: The PROMETHEE and GAIA methods." [[PROMETHEE]]
 
* Deerwester, Scott C., et al. "Indexing by latent semantic analysis." 1990. [http://www.cob.unt.edu/itds/faculty/evangelopoulos/dsci5910/LSA_Deerwester1990.pdf] [[Latent Semantic Analysis]]
 
* Deerwester, Scott C., et al. "Indexing by latent semantic analysis." 1990. [http://www.cob.unt.edu/itds/faculty/evangelopoulos/dsci5910/LSA_Deerwester1990.pdf] [[Latent Semantic Analysis]]
Line 46: Line 47:
 
* Landauer, T. et al. "An introduction to latent semantic analysis." 1998. [http://tottdp.googlecode.com/files/LandauerFoltz-Laham1998.pdf] [[Latent Semantic Analysis]]
 
* Landauer, T. et al. "An introduction to latent semantic analysis." 1998. [http://tottdp.googlecode.com/files/LandauerFoltz-Laham1998.pdf] [[Latent Semantic Analysis]]
 
* Larsen, Bjornar et al. "Fast and effective text mining using linear-time document clustering." 1999. [http://comminfo.rutgers.edu/~muresan/IR/Docs/Articles/sigkddLarsen1999.pdf] [[Document Clustering]]
 
* Larsen, Bjornar et al. "Fast and effective text mining using linear-time document clustering." 1999. [http://comminfo.rutgers.edu/~muresan/IR/Docs/Articles/sigkddLarsen1999.pdf] [[Document Clustering]]
 +
* Lee, K., Lee, Y. et al "Parallel data processing with MapReduce: a survey" 2012. [http://www.cs.arizona.edu/~bkmoon/papers/sigmodrec11.pdf] [[Hadoop]], [[MapReduce]], [[Hadoop MapReduce]]
 
* Li, Yong H., et al. "Classification of text documents." 1998. [http://julio.staff.ipb.ac.id/files/2014/09/LiJ98.pdf] [[Term Clustering]]
 
* Li, Yong H., et al. "Classification of text documents." 1998. [http://julio.staff.ipb.ac.id/files/2014/09/LiJ98.pdf] [[Term Clustering]]
 
* Liu, Tao, et al. "An evaluation on feature selection for text clustering." 2003. [http://www.aaai.org/Papers/ICML/2003/ICML03-065.pdf] [[Term Contribution]]
 
* Liu, Tao, et al. "An evaluation on feature selection for text clustering." 2003. [http://www.aaai.org/Papers/ICML/2003/ICML03-065.pdf] [[Term Contribution]]
Line 51: Line 53:
  
 
=== M ===
 
=== M ===
 +
Manning C., Schütze H. "Foundations of statistical natural language processing", 1999. [[Collocation Extraction]]
 +
 
=== N ===
 
=== N ===
 
=== O ===
 
=== O ===
* Oikonomakou, Nora, and Michalis Vazirgiannis. "A review of web document clustering approaches." Data mining and knowledge discovery handbook. 2010. [https://scholar.google.com/scholar?cluster=1261203777431390097&hl=ru&as_sdt=0,5] [[Cluster Analysis]] [[Agglomerative Clustering]] [[K-Means]]
+
* Oikonomakou, N, Vazirgiannis, M. "A review of web document clustering approaches." Data mining and knowledge discovery handbook. 2010. [https://scholar.google.com/scholar?cluster=1261203777431390097&hl=ru&as_sdt=0,5] [[Cluster Analysis]] [[Agglomerative Clustering]] [[K-Means]]
* Osinski, Stanislaw. "Improving quality of search results clustering with approximate matrix factorisations." 2006. [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.74&rep=rep1&type=pdf] [[Non-Negative Matrix Factorization]]
+
* Osinski, S. "Improving quality of search results clustering with approximate matrix factorisations." 2006. [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.74&rep=rep1&type=pdf] [[Non-Negative Matrix Factorization]]
 +
* Ordonez, C, et al, "Relational versus non-relational database systems for data warehousing." 2010. [http://www2.cs.uh.edu/~ordonez/w-2010-DOLAP-relnonrel.pdf] [[Hadoop]], [[Hadoop MapReduce]]
  
 
=== P ===
 
=== P ===
* Pagael, Rober, and Moritz Schubotz. "Mathematical Language Processing Project." 2014. [http://arxiv.org/abs/1407.0167] [[Mathematical Definition Extraction]] [[Math-Aware POS Tagging]]
+
* Pagael R, Schubotz M. "Mathematical Language Processing Project." 2014. [http://arxiv.org/abs/1407.0167] [[Mathematical Definition Extraction]] [[Math-Aware POS Tagging]]
* Paulevé, Loïc, et al. "Locality sensitive hashing: A comparison of hash function types and querying mechanisms." 2010. [https://hal.inria.fr/inria-00567191/document] [[Locality Sensitive Hashing]], [[K-Means LSH]]
+
* Paulevé, L. et al. "Locality sensitive hashing: A comparison of hash function types and querying mechanisms." 2010. [https://hal.inria.fr/inria-00567191/document] [[Locality Sensitive Hashing]], [[K-Means LSH]]
 +
* Petrović S. et al. "Comparison of collocation extraction measures for document indexing", 2006. [http://bib.irb.hr/datoteka/251298.110-4-157-203.pdf]
  
  

Revision as of 23:50, 25 April 2017

Sources Index

Only papers I read and used as sources (or small books that don't deserve a separate wiki page)

  • ordered by first author
  • ABCDEFGHIJKLMNOPQRSTUVWXYZ


A

B

C

D

E

F

G

H

  • Hopcroft, John, and Ravindran Kannan. "Foundations of Data Science1." 2014. Power Iteration

I

J


K

L


M

Manning C., Schütze H. "Foundations of statistical natural language processing", 1999. Collocation Extraction

N

O

P


Q

R

S


T

U

V

W

X

Y

Z