icde43

icde 2021 论文列表

37th IEEE International Conference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021.

BERT-based Dynamic Clustering of Subway Stations Using Flow Information.
Tensor Topic Models with Graphs and Applications on Individualized Travel Patterns.
Combining Anatomical Constraints and Deep learning for 3-D CBCT Dental Image Multi-label Segmentation.
Graph Based Approach to Real-Time Metro Passenger Flow Anomaly Detection.
MoniLog: An Automated Log-Based Anomaly Detection System for Cloud Computing Infrastructures.
Edge Sparsification for Graphs via Meta-Learning.
REACT: Real-Time Contact Tracing and Risk Monitoring via Privacy-Enhanced Mobile Tracking.
Clouseau: Blockchain-based Data Integrity for HDFS Clusters.
FloraVision: A Spatial Crowd-based Learning System for California Native Plants.
The F4U System for Understanding the Effects of Data Quality.
PITA: Privacy Through Provenance Abstraction.
Odlaw: A Tool for Retroactive GDPR Compliance.
A System for Efficiently Hunting for Cyber Threats in Computer Systems Using Threat Intelligence.
ConCaT: Construction of Category Trees from Search Queries in E-Commerce.
DeBinelle: Semantic Patches for Coupled Database-Application Evolution.
Josch: Managing Schemas for NoSQL Document Stores.
Automated Data Science for Relational Data.
A Cockpit for the Development and Evaluation of Autonomous Database Systems.
UniKG: A Unified Interoperable Knowledge Graph Database System.
CREATe: Clinical Report Extraction and Annotation Technology.
QeNoBi: A System for QuErying and mining BehavIoral Patterns.
SpeakNav: A Voice-based Navigation System via Route Description Language Understanding.
CoWiz: Interactive Covid-19 Visualization Based On Multilayer Network Analysis.
VADETIS: An Explainable Evaluator for Anomaly Detection Techniques.
SOUP: A Fleet Management System for Passenger Demand Prediction and Competitive Taxi Supply.
SubDEx: Exploring Ratings in Subjective Databases.
SPARQLIt: Interactive SPARQL Query Refinement.
Distributed Company Control in Company Shareholding Graphs.
ReLink: Complete-Link Industrial Record Linkage Over Hybrid Feature Spaces.
Efficient and Scalable Structure Learning for Bayesian Networks: Algorithms and Applications.
Improving Conversational Recommender System by Pretraining Billion-scale Knowledge Graph.
Large-scale Fake Click Detection for E-commerce Recommendation Systems.
Turbo: Fraud Detection in Deposit-free Leasing Service via Real-Time Behavior Network Mining.
IPS: Unified Profile Management for Ubiquitous Online Recommendations.
IntelliTag: An Intelligent Cloud Customer Service System Based on Tag Recommendation.
Implementing Rigid Temporal Geometries in Moving Object Databases.
GeoDart: A System for Discovering Maps Discrepancies.
The IoT Meta-Control Firewall.
Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems.
ATNN: Adversarial Two-Tower Neural Network for New Item's Popularity Prediction in E-commerce.
Purchase Intent Forecasting with Convolutional Hierarchical Transformer Networks.
Billion-scale Pre-trained E-commerce Product Knowledge Graph Model.
Explore User Neighborhood for Real-time E-commerce Recommendation.
Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint.
Learnings from a Retail Recommendation System on Billions of Interactions at bol.com.
Query Rewriting via Cycle-Consistent Translation for E-Commerce Search.
Microlearner: A fine-grained Learning Optimizer for Big Data Workloads at Microsoft.
Prefix-Graph: A Versatile Log Parsing Approach Merging Prefix Tree with Probabilistic Graph.
DBSpinner: Making a Case for Iterative Processing in Databases.
Swift: Reliable and Low-Latency Data Processing at Cloud Scale.
Exploratory Data Analysis in SAP IQ Using Query-Time Sampling.
Nullius in Verba: Reproducibility for Database Systems Research, Revisited.
Evaluation of Duplicate Detection Algorithms: From Quality Measures to Test Data Generation.
High-Dimensional Similarity Search for Scalable Data Science.
Workload-Aware Performance Tuning for Autonomous DBMSs.
Countering Bias in Personalized Rankings : From Data Engineering to Algorithm Development.
Fairness in Rankings and Recommenders: Models, Methods and Research Directions.
Constrained Truth Discovery (Extended Abstract).
Discovering Relaxed Functional Dependencies based on Multi-attribute Dominance [Extended Abstract].
Distributed Density Peaks Clustering Revisited (Extended Abstract).
Effective Keyword Search in Weighted Graphs (Extended Abstract).
Towards Query Pricing on Incomplete Data (Extended Abstract).
Truss-based Structural Diversity Search in Large Graphs (Extended Abstract).
A Hybrid Data Cleaning Framework Using Markov Logic Networks (Extended Abstract).
Index-based Solutions for Efficient Density Peak Clustering (Extended Abstract).
LShape Partitioning: Parallel Skyline Query Processing using MapReduce (Extended Abstract).
A Generic Ontology Framework for Indexing Keyword Search on Massive Graphs (Extended Abstract).
Efficient Shapelet Discovery for Time Series Classification (Extended Abstract).
MaxiZone: Maximizing Influence Zone over Geo-Textual Data (Extended Abstract).
Reliability Maximization in Uncertain Graphs (Extended Abstract).
CuWide: Towards Efficient Flow-based Training for Sparse Wide Models on GPUs (Extended Abstract).
ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering (Extended Abstract).
FastDTW is approximate and Generally Slower than the Algorithm it Approximates (Extended Abstract).
Compressed Indexes for Fast Search of Semantic Data (Extended Abstract).
Entity Alignment for Knowledge Graphs with Multi-order Convolutional Networks (Extended Abstract).
LogStore: A Workload-aware, Adaptable Key-Value Store on Hybrid Storage Systems (Extended abstract).
Analyzing In-Memory NoSQL Landscape (Extended Abstract).
A Collective Approach to Scholar Name Disambiguation (Extended Abstract).
Leveraging Currency for Repairing Inconsistent and Incomplete Data (Extended Abstract).
Semantic Search Pipeline: From Query Expansion to Concept Forging.
User Profiling based on Nonlinguistic Audio Data.
Towards Efficient MaxBRNN Computation for Streaming Updates.
Top-k Publish/Subscribe for Ride Hitching.
The LSM RUM-Tree: A Log Structured Merge R-Tree for Update-intensive Spatial Workloads.
SPEAR: Dynamic Spatio-Temporal Query Processing over High Velocity Data Streams.
Near-Optimal Fixed-Route Scheduling for Crowdsourced Transit System.
GRAB: Finding Time Series Natural Structures via A Novel Graph-based Scheme.
Crowdrebate: An Effective Platform to Get more Rebate for Customers.
An Actor-Critic Ensemble Aggregation Model for Time-Series Forecasting.
Predicting the Impact of Disruptions to Urban Rail Transit Systems.
Experimental Study of Big Raster and Vector Database Systems.
Collecting Geospatial Data with Local Differential Privacy for Personalized Services.
Knowledge-Based Dynamic Systems Modeling: A Case Study on Modeling River Water Quality.
DAEMON: Unsupervised Anomaly Detection and Interpretation for Multivariate Time Series.
CrowdAtlas: Estimating Crowd Distribution within the Urban Rail Transit System.
Description Generation for Points of Interest.
TIRA in Baidu Image Advertising.
A Learning to Tune Framework for LSH.
Concurrency Control Based on Transaction Clustering.
TrajForesee: How limited detailed trajectories enhance large-scale sparse information to predict vehicle trajectories?
T3S: Effective Representation Learning for Trajectory Similarity Computation.
ValueNet: A Natural Language-to-SQL System that Learns from Database Information.
Self-Supervised Deep Metric Learning for Pointsets.
Revisiting Data Prefetching for Database Systems with Machine Learning Techniques.
An Autonomous Materialized View Management System with Deep Reinforcement Learning.
Querying for Interactions.
Palette: Towards Multi-source Model Selection and Ensemble for Reuse.
Package Pick-up Route Prediction via Modeling Couriers' Spatial-Temporal Behaviors.
Heterogeneous Information Assisted Bandit Learning: Theory and Application.
Gallat: A Spatiotemporal Graph Attention Network for Passenger Demand Prediction.
Catching them red-handed: Real-time Aggression Detection on Social Media.
AutoOD: Neural Architecture Search for Outlier Detection.
Towards the smart use of embedding and instance features for property matching.
Sequential Recommendation on Dynamic Heterogeneous Information Network.
Batching and Matching for Food Delivery in Dynamic Road Networks.
Top-k Community Similarity Search Over Large Road-Network Graphs.
Fast Distributed Complex Join Processing.
Taking Heuristic Based Graph Edge Partitioning One Step Ahead via OffStream Partitioning Approach.
Structure-Aware Parameter-Free Group Query via Heterogeneous Information Network Transformer.
Stealthy Targeted Data Poisoning Attack on Knowledge Graphs.
Social Visibility Optimization in OSNs with Anonymity Guarantees: Modeling, Algorithms and Applications.
Selective Edge Shedding in Large Graphs Under Resource Constraints.
Hypercore Maintenance in Dynamic Hypergraphs.
HuGE: An Entropy-driven Approach to Efficient and Scalable Graph Embeddings.
EnsemFDet: An Ensemble Approach to Fraud Detection based on Bipartite Graph.
DDHH: A Decentralized Deep Learning Framework for Large-scale Heterogeneous Networks.
Cluster-and-Conquer: When Randomness Meets Graph Locality.
Privacy-Preserving Sequential Publishing of Knowledge Graphs.
Node2LV: Squared Lorentzian Representations for Node Proximity.
CaSIE: Canonicalize and Informative Selection of the OpenIE system.
Substring Similarity Search with Synonyms.
Ranking Papers by their Short-Term Scientific Impact.
Updatable Materialization of Approximate Constraints.
Optimizing Multiple Multi-Way Stream Joins.
CIAO: An Optimization Framework for Client-Assisted Data Loading.
Ranking Desired Tuples by Database Exploration.
PROTEUS: Predictive Explanation of Anomalies.
Patterns Count-Based Labels for Datasets.
Summarizing Provenance of Aggregate Query Results in Relational Databases.
Managing Consent for Data Access in Shared Databases.
From Minimum Change to Maximum Density: On S-Repair under Integrity Constraints.
Ranking Data Slices for ML Model Validation: A Shapley Value Approach.
Multi-Behavior Enhanced Recommendation with Cross-Interaction Collaborative Relation Modeling.
Hierarchical Tree-based Sequential Event Prediction with Application in the Aviation Accident Report.
Decoupled Instance-label Extreme Multi-label Classification with Skew Coordinate Feature Space.
Estimating the extent of the effects of Data Quality through Observations.
High-Performance Smart Contracts Concurrent Execution for Permissioned Blockchain Using SGX.
Joint Index, Sorting, and Compression Optimization for Memory-Efficient Spatio-Temporal Data Management.
Utilizing Delta Trees for Efficient, Iterative Exploration and Transformation of Semi-Structured Contents.
TLBtree: A Read/Write-Optimized Tree Index for Non-Volatile Memory.
SING: Sequence Indexing Using GPUs.
Rethink the Linearizability Constraints of Raft for Distributed Key-Value Stores.
Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems.
DS2: Handling Data Skew Using Data Stealings over High-Speed Networks.
Accelerating Similarity-based Mining Tasks on High-dimensional Data by Processing-in-memory.
SciChain: Blockchain-enabled Lightweight and Efficient Data Provenance for Reproducible Scientific Computing.
Meepo: Sharded Consortium Blockchain.
SLIMSTORE: A Cloud-based Deduplication System for Multi-version Backups.
Accelerating the Yinyang K-Means Algorithm Using the GPU.
Performance Characterization of HTAP Workloads.
Evaluating List Intersection on SSDs for Parallel I/O Skipping.
An Empirical Experiment on Deep Learning Models for Predicting Traffic Data.
Memory-Efficient Database Fragment Allocation for Robust Load Balancing when Nodes Fail.
Spangle: A Distributed In-Memory Processing System for Large-Scale Arrays.
A Two-layer Partitioning for Non-point Spatial Data.
TASM: A Tile-Based Storage Manager for Video Analytics.
Efficient Constrained Shortest Path Query Answering with Forest Hop Labeling.
Forecasting Ambulance Demand with Profiled Human Mobility via Heterogeneous Multi-Graph Neural Networks.
EnhanceNet: Plugin Neural Networks for Enhancing Correlated Time Series Forecasting.
Automatic Webpage Briefing.
Optimally Summarizing Data by Small Fact Sets for Concise Answers to Voice Queries.
Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees.
Fast Similarity Computation for t-SNE.
G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression.
Improving Constrained Search Results By Data Melioration.
MLCask: Efficient Management of Component Evolution in Collaborative Data Analytics Pipelines.
Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation.
A Fully Dynamic Algorithm for k-Regret Minimizing Sets.
ProMIPS: Efficient High-Dimensional c-Approximate Maximum Inner Product Search with a Lightweight Index.
LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams.
Approximating Multidimensional Range Counts with Maximum Error Guarantees.
Attacking Black-box Recommendations via Copying Cross-domain User Profiles.
Knowledge-Aware Group Representation Learning for Group Recommendation.
Variational Self-attention Network for Sequential Recommendation.
Reliable Recommendation with Review-level Explanations.
Group-Buying Recommendation for Social E-Commerce.
Multi-Facet Recommender Networks with Spherical Optimization.
Trillion-scale Graph Processing Simulation based on Top-Down Graph Upscaling.
Privacy Preserving Strong Simulation Queries on Large Graphs.
Influence Maximization Based on Dynamic Personal Perception in Knowledge Graph.
Explaining Missing Data in Graphs: A Constraint-based Approach.
A+ Indexes: Tunable and Space-Efficient Adjacency Lists in Graph Database Management Systems.
FAST: FPGA-based Subgraph Matching on Massive Graphs.
Samya: A Geo-Distributed Data System for High Contention Aggregate Data.
Efficient Control Flow in Dataflow Systems: When Ease-of-Use Meets High Performance.
Lock Violation for Fault-tolerant Distributed Database System*.
WipDB: A Write-in-place Key-value Store that Mimics Bucket Sort.
RCC: Resilient Concurrent Consensus for High-Throughput Secure Transaction Processing.
Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+.
Scalable Graph Isomorphism: Combining Pairwise Color Refinement and Backtracking via Compressed Candidate Space.
An Efficient Algorithm for the Anchored k-Core Budget Minimization Problem.
Finding a Summary for All Maximal Cliques.
DPTL+: Efficient Parallel Triangle Listing on Batch-Dynamic Graphs.
PEFP: Efficient k-hop Constrained s-t Simple Path Enumeration on FPGA.
A Framework to Quantify Approximate Simulation on Graph Data.
Automating Entity Matching Model Development.
Structured Object Matching across Web Page Revisions.
Cost-effective Variational Active Entity Resolution.
KDDLog: Performance and Scalability in Knowledge Discovery by Declarative Queries with Aggregates.
End-to-end Task Based Parallelization for Entity Resolution on Dynamic Data.
Learning to Characterize Matching Experts.
Spatial-Temporal Similarity for Trajectories with Location Noise and Sporadic Sampling.
On Efficient and Scalable Time-Continuous Spatial Crowdsourcing.
Data-Driven Fairness-Aware Vehicle Displacement for Large-Scale Electric Taxi Fleets.
LHist: Towards Learning Multi-dimensional Histogram for Massive Spatial Data.
A Distance-Based Scheme for Reducing Bandwidth in Distributed Geometric Monitoring.
Workload-aware Materialization for Efficient Variable Elimination on Bayesian Networks.
Efficient Construction of Nonlinear Models over Normalized Data.
An Efficient Approach for Cross-Silo Federated Learning to Rank.
INFOSHIELD: Generalizable Information-Theoretic Human-Trafficking Detection.
Efficient Relation-aware Scoring Function Search for Knowledge Graph Embedding.
EDGE: Entity-Diffusion Gaussian Ensemble for Interpretable Tweet Geolocation Prediction.
DisMASTD: An Efficient Distributed Multi-Aspect Streaming Tensor Decomposition.
Concept Drift Detection from Multi-Class Imbalanced Data Streams.
Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information.
FPGA for Aggregate Processing: The Good, The Bad, and The Ugly.
CruiseDB: An LSM-Tree Key-Value Store with Both Better Tail Throughput and Tail Latency.
Aria: Tolerating Skewed Workloads in Secure In-memory Key-value Stores.
NestGPU: Nested Query Processing on GPU.
Authenticated Keyword Search in Scalable Hybrid-Storage Blockchains.
Memory-Efficient Key/Foreign-Key Join Size Estimation via Multiplicity and Intersection Size.
Eclipse: Generalizing kNN and Skyline.
Continuously Bulk Loading over Range Partitioned Tables for Large Scale Historical Data.
The Logarithmic Dynamic Cuckoo Filter.
Fast Core-based Top-k Frequent Pattern Discovery in Knowledge Graphs.
Property Graph Schema Optimization for Domain-Specific Knowledge Graphs.
Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks.
A Bootstrapping Approach to Optimize Random Walk Based Statistical Estimation over Graphs.
On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner.
NewsLink: Empowering Intuitive News Search with Knowledge Graphs.
SALSA: Self-Adjusting Lean Streaming Analytics.
Single Point Incremental Fourier Transform on 2D Data Streams.
Robust Factorization of Real-world Tensor Streams with Patterns, Missing Values, and Outliers.
DISC: Density-Based Incremental Clustering by Striding over Streaming Data.
SliceNStitch: Continuous CP Decomposition of Sparse Tensor Streams.
LogLog Filter: Filtering Cold Items within a Large Range over High Speed Data Streams.
Efficiently Reclaiming Space in a Log Structured Store.
Discriminative Admission Control for Shared-everything Database under Mixed OLTP Workloads.
Predict and Write: Using K-Means Clustering to Extend the Lifetime of NVM Storage.
Programming an SSD Controller to Support Batched Writes for Variable-Size Pages.
DyCuckoo: Dynamic Hash Tables on GPUs.
The Case for In-Memory OLAP on "Wimpy" Nodes.
Durable Top-K Instant-Stamped Temporal Records with User-Specified Scoring Functions.
REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries.
E2DTC: An End to End Deep Trajectory Clustering Framework via Self-Training.
Trajectory Simplification with Reinforcement Learning.
Leveraging Temporal and Topological Selectivities in Temporal-clique Subgraph Query Processing.
Flow Computation in Temporal Interaction Networks.
HST+: An Efficient Index for Embedding Arbitrary Metric Spaces.
Hash Adaptive Bloom Filter.
Multidimensional Adaptive & Progressive Indexes.
Less is More: De-amplifying I/Os for Key-value Stores with a Log-assisted LSM-tree.
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees.
TS-Benchmark: A Benchmark for Time Series Databases.
Noah: Neural-optimized A* Search Algorithm for Graph Edit Distance Computation.
FastSGG: Efficient Social Graph Generation Using a Degree Distribution Generation Model.
Search to aggregate neighborhood for graph neural network.
LineageBA: A Fast, Exact and Scalable Graph Generation for the Barabási-Albert Model.
Towards Efficient Motif-based Graph Partitioning: An Adaptive Sampling Approach.
UniNet: Scalable Network Representation Learning with Metropolis-Hastings Sampling.
Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech Diffusion on Twitter.
Latent Low-rank Graph Learning for Multimodal Clustering.
Odess: Speeding up Resemblance Detection for Redundancy Elimination by Fast Content-Defined Sampling.
Valentine: Evaluating Matching Techniques for Dataset Discovery.
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach.
Relational Header Discovery using Similarity Search in a Table Corpus.
Interactive Analytic DBMSs: Breaching the Scalability Wall.
CooLSM: Distributed and Cooperative Indexing Across Edge and Cloud Machines.
WedgeChain: A Trusted Edge-Cloud Store With Asynchronous (Lazy) Trust.
Spark-based Cloud Data Analytics using Multi-Objective Optimization.
Communication-efficient Decentralized Machine Learning over Heterogeneous Networks.
Efficient Federated-Learning Model Debugging.
A Learning-based Method for Computing Shortest Path Distances on Road Networks.
An Effective Joint Prediction Model for Travel Demands and Traffic Flows.
Dynamic Hub Labeling for Road Networks.
Online Route Planning over Time-Dependent Road Networks.
Constrained Route Planning over Large Multi-Modal Time-Dependent Networks.
Rebuilding City-Wide Traffic Origin Destination from Road Speed Data.
CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling.
A Human-in-the-loop Approach to Social Behavioral Targeting.
Fairness-aware Task Assignment in Spatial Crowdsourcing: Game-Theoretic Approaches.
Crowdsensing Data Trading based on Combinatorial Multi-Armed Bandit and Stackelberg Game.
Coalition-based Task Assignment in Spatial Crowdsourcing.
A Privacy-Enhanced and Personalized Safe Route Planner with Crowdsourced Data and Computation.
Modeling Citywide Crowd Flows using Attentive Convolutional LSTM.
Twine: An Embedded Trusted Runtime for WebAssembly.
Enabling Efficient Cyber Threat Hunting With Cyber Threat Intelligence.
Feature Inference Attack on Model Predictions in Vertical Federated Learning.
P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model.
Secure Dynamic Skyline Queries Using Result Materialization.
Differentially Private Publication of Multi-Party Sequential Data.
Efficient 2-Hop Labeling Maintenance in Dynamic Small-World Networks.
Peer Learning Through Targeted Dynamic Groups Formation.
Multi-attributed Community Search in Road-social Networks.
Efficient Community Search with Size Constraint.
Efficient and Effective Community Search on Large-scale Bipartite Graphs.
Manipulating Black-Box Networks for Centrality Promotion.
Capturing Semantics for Imputation with Pre-trained Language Models.
Bootstrapping Information Extraction via Conceptualization.
DBSCOUT: A Density-based Method for Scalable Outlier Detection in Very Large Datasets.
Approximate Order Dependency Discovery.
CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks.
Profiles of Schema Evolution in Free Open Source Software Projects.