Skip to main content
Login
Catalog
Repository
Education
Projects
Tools
Services
About
Partners
Mission Statement
CLARIN
DARIAH
Service integrations
Project partnerships
Home
Search
2 results
Back to results
Limit your search
Author
2
Steger, Johannes M.
2
Stemle, Egon W.
Subject
web page cleaning
2
boiler plate removal
2
manual annotation
2
training data
2
WaC
2
Web as Corpus
Show more
Search subject
Submit
Rights
2
PUB
Language (ISO)
2
English
Type
2
corpus
2
text
Contain Files
2
No
Community
2
CMC & WaC
Reset filters
Settings
Sort By
Most Relevant
Title Asc
Title Desc
Date Issued Asc
Date Issued Desc
Results per page
1
5
10
20
40
60
80
100
All of DSpace
Search
Subject: web page cleaning
×
Show as list
Search Tools
Search Results
Showing
1 - 2 out of 2 results
corpus
CMC & WaC
KrdWrd CANOLA Corpus 1.1
Publisher:
(
Institute for Applied Linguistics, Eurac Research
/
2010-11-25)
Author(s):
Stemle, Egon W.
and
Steger, Johannes M.
Publicly Available
corpus
CMC & WaC
KrdWrd CANOLA Corpus 1.0
Publisher:
(
Institute for Applied Linguistics, Eurac Research
/
2010-09-10)
Author(s):
Stemle, Egon W.
and
Steger, Johannes M.
Publicly Available