cikm 2009 论文列表
Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009.
|
CloudDB workshop summary.
Privacy and anonymization for very large datasets.
TSA'09 workshop summary: topic-sentiment analysis.
Bridging the gap: complex networks meet information and knowledge management.
ASIC: algebra-based structural index comparison.
XQGen: an algebra-based XPath query generator for micro-benchmarking.
A graphical browser for XML schema documents.
Efficient and reliable merging of XML documents.
MRM: an adaptive framework for XML searching.
SOIRE: a service-oriented IR evaluation architecture.
RefMed: relevance feedback retrieval system fo PubMed.
RSS watchdog: an instant event monitor on real online news streams.
Helping people to choose for whom to vote. a web information system for the 2009 European elections.
A flexible simulation environment for flash-aware algorithms.
OSSOBOOK: database and knowledgemanagement techniques for archaeozoology.
AnchorWoman: top-k structured mobile web search engine.
SPIDER: a system for scalable, parallel / distributed evaluation of large-scale RDF data.
Constructing evolutionary taxonomy of collaborative tagging systems.
LuposDate: a semantic web database system.
VRIFA: a nonlinear SVM visualization tool using nomogram and localized radial basis function (LRBF) kernels.
YAM: a schema matcher factory.
OfCourse: web content discovery, classification and information extraction for online course materials.
HDDBrs middleware for implementing highly available distributed databases.
OLAP with UDFs in digital libraries.
Demonstration of an RFID middleware: LIT ALE manager.
A novel distributed P2P simulator architecture: D-P2P-sim.
DS-Cuber: an integrated OLAP environment for data streams.
M-COPE: a multiple continuous query processing engine.
Stochastic gradient boosted distributed decision trees.
Easiest-first search: towards comprehension-based web search.
Interactive relevance feedback with graded relevance and sentence extraction: simulated user experiments.
Automatic generation of topic pages using query-based aspect models.
Multidimensional political spectrum identification and analysis.
Boosting KNN text classification accuracy by using supervised term weighting schemes.
Automatic query generation for patent search.
Measuring system performance and topic discernment using generalized adaptive-weight mean.
Evaluation of methods for relative comparison of retrieval systems based on clickthroughs.
Feature selection for ranking using boosted trees.
Improving binary classification on text problems using differential word features.
Ensembles in adversarial classification for spam.
Finding good feedback documents.
Incorporating robustness into web ranking evaluation.
Generating synopses for document-element search.
A study of selective collection enrichment for enterprise search.
Location cache for web queries.
An analysis framework for search sequences.
URL normalization for de-duplication of web pages.
The influence of the document ranking in expert search.
A scalable and effective full-text search in P2P networks.
Retrieval constraints and word frequency distributions: a log-logistic model for IR.
An improved feedback approach using relevant local posts for blog feed retrieval.
Graph-based seed selection for web-scale crawlers.
Comparative document summarization via discriminative sentence selection.
Exploring path query results through relevance feedback.
Answer typing for information retrieval.
Exploiting query views for static index pruning in web search engines.
Aging effects on query flow graphs for query suggestion.
Smoothing document language model with local word graph.
Data extraction from the web using wild card queries.
A proactive personalised retrieval system.
Pseudo relevance feedback using semantic clustering in relevance language model.
A collaborative filtering approach to ad recommendation using the query-ad click graph.
Smoothing DCG for learning to rank: a novel approach using smoothed hinge functions.
Collaborative resource discovery in social tagging systems.
Pure spreading activation is pointless.
A word clustering approach for language model-based sentence retrieval in question answering systems.
Online community search using thread structure.
A query model based on normalized log-likelihood.
Translating relevance scores to probabilities for contextual advertising.
A comparative study of methods for estimating query language models with pseudo feedback.
What makes categories difficult to classify?: a study on predicting classification performance for categories.
Maximal metric margin partitioning for similarity search indexes.
To divide and conquer search ranking by learning query difficulty.
Learning to rank using evolutionary computation: immune programming or genetic programming?
Matching person names through name transformation.
Learning to rank graphs for online similar graph search.
Learning from past queries for resource selection.
Improving retrievability of patents with cluster-based pseudo-relevance feedback documents selection.
Relying on topic subsets for system ranking estimation.
HyperSum: hypergraph based semi-supervised sentence ranking for query-oriented summarization.
An efficient clustering algorithm for large-scale topical web pages.
Exploring relevance for clicks.
What's behind topic formation and development: a perspective of community core groups.
Exploiting bidirectional links: making spamming detection easier.
A general markov framework for page importance computation.
Dynamic hyperparameter optimization for bayesian topical trend analysis.
The effect of negation on sentiment analysis and retrieval effectiveness.
User interests in social media sites: an exploration with micro-blogs.
To obtain orthogonal feature extraction using training data selection.
Incremental query evaluation for support vector machines.
A machine learning approach for improved BM25 retrieval.
A co-classification framework for detecting web spam and spammers in social media web sites.
Blogger-centric contextual advertising.
Multi-aspect opinion polling from textual reviews.
Fragment-based clustering ensembles.
Opinion classification with tree kernel SVM using linguistic modality analysis.
Identifying interesting assertions from the web.
Active learning in partially supervised classification.
CAOFES: an ontological framework for web service retrieval.
Interpretable and reconfigurable clustering of document datasets by deriving word-based rules.
ComprehEnRank: estimating comprehension in classroom by absorbing random walks on a cognitive graph.
Predicting the volume of comments on online news stories.
Spatio-temporal association rule mining framework for real-time sensor network applications.
Topic and keyword re-ranking for LDA-based topic modeling.
Agglomerating local patterns hierarchically with ALPHA.
Building domain-oriented sentiment lexicon by improved information bottleneck.
Vetting the links of the web.
Finding the topical anchors of a context using lexical cooccurrence data.
Combining labeled and unlabeled data with word-class distribution learning.
Enhancing expertise retrieval using community-aware strategies.
XCFS: an XML documents clustering approach using both the structure and the content.
The impact of document structure on keyphrase extraction.
Kernel latent semantic analysis using an information retrieval based kernel.
Cross-domain sentiment classification using a two-stage method.
Mining tourist information from user-supplied collections.
Mining and ranking streams of news stories using cross-stream sequential patterns.
MagicCube: choosing the best snippet for each aspect of an entity.
Automatic link detection: a sequence labeling approach.
Constrained multi-aspect expertise matching for committee review assignment.
Acronym extraction and disambiguation in large-scale organizational web pages.
Real-word spelling correction using Google web 1Tn-gram data set.
A fast and simple method for extracting relevant content from news webpages.
Using negative voting to diversify answers in non-factoid question answering.
Using domain ontology for semantic web usage mining and next page prediction.
iPoG: fast interactive proximity querying on graphs.
Modeling context-dependent information.
Efficient multi-class unlabeled constrained semi-supervised SVM.
Identifying comparable entities on the web.
Experiments on pattern-based relation learning.
MING: mining informative entity relationship subgraphs.
LoOP: local outlier probabilities.
Automatic web data extraction using tree alignment.
Consistent on-line classification of dbs workload events.
Exploiting term relationship to boost text classification.
Clustering object moving patterns for prediction-based object tracking sensor networks.
Feature engineering on event-centric surrogate documents to improve search results.
A hybrid index structure for geo-textual searches.
Exploring multimedia databases via optimization-based relevance feedback and the earth mover's distance.
Using opinion-based features to boost sentence retrieval.
MatchSim: a novel neighbor-based similarity measure with maximum neighborhood matching.
Topic analysis for topic-focused multi-document summarization.
An effective model of using negative relevance feedback for information filtering.
On domain similarity and effectiveness of adapting-to-rank.
Utilizing inter-passage and inter-document similarities for re-ranking search results.
Instance- and bag-level manifold regularization for aggregate outputs classification.
Text summarization model based on the budgeted median problem.
Context sensitive synonym discovery for web search queries.
Web search result summarization: title selection algorithms and user satisfaction.
Who tags the tags?: a framework for bookmark weighting.
Effective and efficient structured retrieval.
Clustering queries for better document ranking.
Similarity-aware indexing for real-time entity resolution.
Adaptive web mining of bilingual lexicons for cross language information retrieval.
iRANK: an interactive ranking framework and its application in query-focused summarization.
Text segmentation via topic modeling: an analytical study.
Multi-task learning for learning to rank in web search.
Exploit the tripartite network of social tagging for web clustering.
Injecting purpose and trust into data anonymisation.
(Not) yet another matcher.
Context-sensitive document ranking.
Extraction of a latent blog community based on subject.
The gardener's problem for web information monitoring.
Mining frequent itemsets in time-varying data streams.
Privacy without noise.
Scalable indexing of RDF graphs for efficient join processing.
Inverted indexes vs. bitmap indexes in decision support systems.
Semantic queries in databases: problems and challenges.
Towards non-directional Xpath evaluation in a RDBMS.
Online anonymity for personalized web services.
Cluster based rank query over multidimensional data streams.
Multidimensional routing indices for efficient distributed query processing.
Dynamic in-page logging for flash-aware B-tree index.
Efficient processing of group-oriented connection queries in a large graph.
Matching stream patterns of various lengths and tolerances.
Diverging patterns: discovering significant frequency change dissimilarities in large databases.
A framework for safely publishing communication traces.
Effective anonymization of query logs.
Label correspondence learning for part-of-speech annotation transformation.
RS-Wrapper: random write optimization for solid state drive.
Structure-aware indexing for keyword search in databases.
Incremental similarity joins with edit distance constraints.
Progressive skyline query evaluation and maintenance in wireless sensor networks.
Walking in the crowd: anonymizing trajectory data for pattern analysis.
Supporting context-based query in personal DataSpace.
Group-by skyline query processing in relational engines.
Rank-aware clustering of structured datasets.
Workload-aware trie indices for XML.
Discovering matching dependencies.
Suffix trees for very large genomic sequences.
Probabilistic moving range query over RFID spatio-temporal data streams.
Minimal common container of tree patterns.
3se: a semi-structured search engine for heterogeneous data in graph model.
ROSE: retail outlet site evaluation by learning with both sample and feature preference.
Towards real-time measurement of customer satisfaction using automatically generated call transcripts.
Predicting the conversion probability for items on C2C ecommerce sites.
iLoc: a framework for incremental location-state acquisition and prediction based on mobile sensors.
ExSearch: a novel vertical search engine for online barter business.
A risk minimization framework for domain adaptation.
Subspace maximum margin clustering.
Large margin transductive transfer learning.
Detection of orthogonal concepts in subspaces of high dimensional data.
Incident threading for news passages.
Retrieval experiments using pseudo-desktop collections.
Probabilistic models of ranking novel documents for faceted topic retrieval.
Classification-based resource selection.
A term dependency-based approach for query terms ranking.
Enhancing recommender systems under volatile userinterest drifts.
A social recommendation framework based on multi-scale continuous conditional random fields.
Beyond hyperlinks: organizing information footprints in search logs to support effective browsing.
Personalized social search based on the user's social network.
PQC: personalized query classification.
Characteristics of document similarity measures for compliance analysis.
A system for detecting xml similarity in content and structure using relational database.
Generating SQL/XML query and update statements.
Characterizing, constructing and managing resource usage profiles of system S applications: challenges and experience.
Fast and effective histogram construction.
Efficient feature weighting methods for ranking.
Compressing tags to find interesting media groups.
Time sequence summarization to scale up chronology-dependent applications.
Socializing or knowledge sharing?: characterizing social intent in community question answering.
Blog cascade affinity: analysis and prediction.
Scalable learning of collective behavior based on sparse social dimensions.
Completing wikipedia's hyperlink structure through dimensionality reduction.
Product feature categorization with multilevel latent semantic association.
Improving web page classification by label-propagation over click graphs.
Framework for timely and accurate ads on mobile devices.
Domain driven data mining to improve promotional campaign ROI and select marketing channels.
Practical lessons of data mining at Yahoo!
POkA: identifying pareto-optimal k-anonymous nodes in a domain hierarchy lattice.
A framework for semantic link discovery over relational data.
Efficient joins with compressed bitmap indexes.
Fuzzy semantic web ontology learning from fuzzy UML model.
Supporting ranking pattern-based aggregate queries in sequence data cubes.
Heterogeneous cross domain ranking in latent space.
Language-model-based ranking for queries on RDF-graphs.
Efficient information retrieval in mobile peer-to-peer networks.
Detecting topic evolution in scientific literature: how can citations help?
A unified relevance model for opinion retrieval.
Graph-based transfer learning.
SELC: a self-supervised model for sentiment classification.
Exploiting internal and external semantics for the clustering of short texts using world knowledge.
Evidence of quality of textual features on the web 2.0.
Clustering web queries.
Information extraction meets relation databases.
Mining data streams with periodically changing distributions.
Evaluating top-k queries over incomplete data streams.
Fast shortest path distance estimation in large networks.
Efficient join processing on uncertain data streams.
A code generation approach to optimizing high-performance distributed data stream processing.
Reducing the risk of query expansion via robust constrained optimization.
Learning to rank from Bayesian decision inference.
A general magnitude-preserving boosting algorithm for search ranking.
Nonlinear static-rank computation.
A signal-to-noise approach to score normalization.
User-induced links in collaborative tagging systems.
Voting in social networks.
Semi-nonnegative matrix factorization with global statistical consistency for collaborative filtering.
Probabilistic latent preference analysis for collaborative filtering.
Learning to recommend questions based on user ratings.
Product query classification.
iMecho: an associative memory based desktop search system.
A study of information retrieval on accumulative social descriptions using the generation features.
Mashup-based information retrieval for domain experts.
Automatic retrieval of similar content using search engine query interface.
Navigational path privacy protection: navigational path privacy protection.
Provenance query evaluation: what's so special about it?
Scalable continuous range monitoring of moving objects in symbolic indoor space.
Density-based clustering using graphics processors.
Probabilistic skyline queries.
Post-rank reordering: resolving preference misalignments between search engines and end users.
Usage based effectiveness measures: monitoring application performance in information retrieval.
Expected reciprocal rank for graded relevance.
Empirical justification of the gain and discount function for nDCG.
Improvements that don't add up: ad-hoc retrieval results since 1998.
L2 norm regularized feature kernel regression for graph data.
Frequent subgraph pattern mining on uncertain graph data.
Graph classification based on pattern co-occurrence.
Independent informative subgraph mining for graph information retrieval.
P-Rank: a comprehensive structural similarity measure over information networks.
Interactive, topic-based visual text summarization and analysis.
Msuggest: a semantic recommender framework for traditional chinese medicine book search engine.
Event detection from flickr data through wavelet-based spatial analysis.
Towards a universal wordnet by learning from combined evidence.
Learning to rank with a novel kernel perceptron method.
Probabilistic models for topic learning from images and captions in online biomedical literatures.
A query language for analyzing networks.
Answering XML queries using materialized views revisited.
Bitmap indexes for relational XML twig query processing.
Low-cost management of inverted files for online full-text search.
Adaptive geospatially focused crawling.
On-line index maintenance using horizontal partitioning.
On the feasibility of multi-site web search engines.
Compact full-text indexing of versioned document collections.
Terminology mining in social media.
sDoc: exploring social wisdom for document enhancement in web mining.
Generating comparative summaries of contradictory opinions in text.
Joint sentiment/topic model for sentiment analysis.
Learning document aboutness from implicit user feedback and document structure.
Efficient itemset generator discovery over a stream sliding window.
Message family propagation for ising mean field based on iteration tree.
Mining linguistic cues for query expansion: applications to drug interaction search.
An integrated discriminative probabilistic approach to information extraction.
Efficient algorithms for approximate member extraction using signature-based inverted lists.
Robust record linkage blocking using suffix arrays.
AS-index: a structure for string search using n-grams and algebraic signatures.
Space-economical partial gram indices for exact substring matching.
Improving search engines using human computation games.
The use of categorization information in language models for question retrieval.
Adaptive relevance feedback in information retrieval.
Computational community interest for ranking.
Using multiple ontologies in information extraction.
Helping editors choose better seed sets for entity set expansion.
Named entity disambiguation by leveraging wikipedia semantic knowledge.
Data-driven compound splitting method for english compounds in domain names.
Ranking model adaptation for domain-specific search.
Supervised semantic indexing.
Learning better transliterations.
A translation model for matching reviews to objects.
Intention-focused active reranking for image object retrieval.
Effective XML content and structure retrieval with relevance ranking.
Linear inclusion for XML regular expression types.
Dissemination of heterogeneous XML data in publish/subscibe systems.
Efficient processing of twig pattern matching in fuzzy XML.
Effective, design-independent XML keyword search.
Clustering and exploring search results using timeline constructions.
Characterizing and predicting search engine switching behavior.
Analyzing and evaluating query reformulation strategies in web search logs.
Characterizing commercial intent.
What happens after an ad click?: quantifying the impact of landing pages in web advertising.
Efficient record-level wrapper induction.
Semi-supervised learning of semantic classes for query understanding: from the web and for the web.
Query by analogical example: relational search using web search engine indices.
An empirical study on using hidden markov model for search interface segmentation.
StereoTrust: a group based personalized trust model.
Advanced metasearch engines.
Confucius and "its" intelligent disciples.
DB-IR integration and its application to a massively-parallel search engine.