Publications

Publications

2020

Bingzhe Li, Nae Young Song, Ou Li, David Du. "Can We Store the Whole World's Data in DNA Storage?", HotStorage'2020.

Fenggang Wu, Ming-Hong Yang, Baoquan Zhang, David Du. "AC-Key: Adaptive Caching for LSM-based Key-Value Stores", ATC'2020

Fenggang Wu, Bingzhe Li, Baoquan Zhang, Zhichao Cao, Jim Diehl, Hao Wen, and David H.C. Du. "TrackLace: Data Management for Interlaced Magnetic Recording", Transactions on Computers

Baoquan Zhang, Ming-Hong Yang, Xuchao Xie. "Idler : I/O Workload Controlling for Better Responsiveness on Host-Aware Shingled Magnetic Recording Drives", Transactions on Computers

Zhichao Cao, Siying Dong, Sagar Vemuri, David H.C. Du. "Characterizing, Modeling, and Benchmarking RocksDB Key-Value Workloads at Facebook", FAST’2020

2019

Bingzhe Li, Chunhua Deng, Jinfeng Yang, David Lilja, Bo Yuan, and David Du. "HAML-SSD: A hardware-accelerated hotness-aware machine learning-based SSD management." ICCAD 2019

Hao Wen et al. "Distributing Deep Neural Networks with Containerized Partitions at the Edge", HotEdge'19.

Fenggang Wu et al. "ZoneAlloy: Elastic Data and Space Management for Hybrid SMR Drives", HotStorage'19.

Zhichao Cao et al. "TDDFS: A Tier-Aware Data Deduplication-Based File System", Transactions on Storage.

Hao Wen et al. "Guaranteed Bang for the Buck: Modeling VDI Applications to Identify Storage Requirements", Transactions on Cloud Computing.

Zhichao Cao, Shiyong Liu, Fenggang Wu, Guohua Wang, Bingzhe Li, and David HC Du. "Sliding Look-Back Window Assisted Data Chunk Rewriting for Improving Deduplication Restore Performance." In 17th {USENIX} Conference on File and Storage Technologies ({FAST} 19), pp. 129-142. 2019.

 

2018

Yaobin Qin and David J. Lilja, "Enhancing the Ensemble of Exemplar-SVMs for Binary Classification Using Concurrent Selection and EnsembleLearning", IEEE Ubiquitous Computing, Electronics, and Mobile Communications Conference (UEMCON), November, 2018.

Yaobin Qin, Brandon Hoffmann, and David J. Lilja, "HyperProtect:Enhancing the Performance of a Dynamic Backup System Using Intelligent Scheduling", International Performance, Computing, and Communications Conference (IPCCC), November, 2018.

Bingzhe Li, Meng Yang, Soheil Mohajer, Weikang Qian, and David J.Lilja, "Tier-code: An XOR-based RAID-6 Code with Improved Write and Degraded-mode Read Performance", IEEE International Conference on Networking, Architecture, and Storage (NAS), October, 2018.

Jinfeng Yang and David J. Lilja, "Reducing Relational Database Performance Bottlenecks Using 3D Crosspoint Storage Technology", IEEE International Conference On Big Data Science And Engineering (BigDataSE), July, 2018.

Xiongzi Ge, Xuchao Xie, David H.C. Du, Pradeep Ganesan and Dennis Hahn, "ChewAnalyzer: Workload-Aware Data Management Across Differentiated Storage Pools", In Proceedings of the 26th IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems MASCOTS'18.

Hao Wen, Zhizhao Cao, Yang Zhang, Ziqi Fan, Doug Voigt, David H.C. Du, "JoiNS: Meeting latency SLO with Integrated Control for Networked Storage", In Proceedings of the 26th IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems MASCOTS'18.

Fenggang Wu, Baoquan Zhang, Zhichao Cao, Hao Wen, Bingzhe Li, Jim Diehl, Guohua Wang, David H.C. Du, "Data Management Design for Interlaced Magnetic Recording", HotStorage'18.

Baoquan Zhang, Raghunath Raja Chandrasekar, Alireza Haghdoost, Lance Evans, David H.C. Du, "Improving Data Integrity in Linux Software RAID with Protection Information (T10-PI)", In Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'18 Workshop), pp.609 - 615, 2018.

Ziqi Fan, Fenggang Wu, Jim Diehl, David HC Du, and Doug Voigt. "CDBB: an NVRAM-based burst buffer coordination system for parallel file systems." In Proceedings of the High Performance Computing Symposium, p. 1. Society for Computer Simulation International, 2018.

Hebatalla Eldakiky, David Du, "Key-Value Pairs Allocation Strategy for Kinetic Drives", BigData Service 2018.

Zhichao Cao, Hao Wen, Fenggang Wu, and David HC Du. "ALACC: Accelerating Restore Performance of Data Deduplication Systems Using Adaptive Look-Ahead Window Assisted Chunk Caching.", FAST'18

Baoquan Zhang, Fenggang Wu, Zhichao Cao and David H.C. Du, "NV-LSM: Log-Structured Merging on Hybrid Volatile/Non-Volatile Memory Systems", FAST'18 WiP Demo

2017

Alireza Haghdoost, Weiping He, Jerry Fredin, and David H. C. Du. 2017. "hfplayer: Scalable Replay for Intensive Block I/O Workloads", ACM Trans. Storage 13, 4, Article 39 (December 2017), 25 pages.

F. Wu, Z. Fan, M.-C. Yang, B. Zhang, X. Ge, D.H.C. Du, "Performance Evaluation of Host Aware Shingled Magnetic Recording (HA-SMR) Drives", IEEE Transactions on Computers, 2017. [PDF]

Manas Minglani, Jim Diehl, Xiang Cao, Bingzhe Li, Dongchul Park,David J. Lilja, and David H.C. Du, "Kinetic Action: Performance Analysis of Integrated Key-Value Storage Devices vs. LevelDB Servers", International Conference on Parallel Architectures and Distributed Systems (ICPADS), December, 2017.

Bingzhe Li, David J. Lilja, Farnaz Toussi, Clark Anderson, and David Du, "TraceRAR: An I/O Performance Evaluation Tool for Replaying, Analyzing, and Regenerating Traces", IEEE International Conference on Networking, Architecture, and Storage (NAS), August, 2017.

Park, D., Fan, Z., Nam, Y. J., & Du, D. H. (2017). A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage. Journal of Computer Science and Technology, 32(1), 26-40.

A. Haghdoost, W. He, J. Fredin, D. Du, "On the Accuracy and Scalability of Intensive I/O Workload Replay", FAST 2017

W. He, D. Du, "SMaRT: An Approach to Shingled Magnetic Recording Translation", FAST 2017

Fan, Ziqi, Fenggang Wu, Dongchul Park, Jim Diehl, Doug Voigt, and David HC Du. "Hibachi: A cooperative hybrid cache with nvram and dram for storage arrays." In Proc. of IEEE Conference on Mass Storage Systems and Technologies (MSST). 2017.

X. Cao, M. Minglani, D. Du, "Data Allocation of Large-scale Key-Value Store System using Kinetic Drives" IEEE Big Data Service 2017

 

2016

Dongchul, P. A. R. K., Biplob Debnath, and HC DU David. "A Dynamic Switching Flash Translation Layer Based on Page-Level Mapping." IEICE TRANSACTIONS on Information and Systems 99.6 (2016): 1502-1511. [PDF]

Liu, Y., Ge, X., Du, D. H. C., & Huang, X. (2016). Par-BF: a parallel partitioned Bloom filter for dynamic data sets. The International Journal of High Performance Computing Applications, 30(3), 259-275. [PDF]

Wu, F., Yang, M. C., Fan, Z., Zhang, B., Ge, X., & Du, D. H. (2016, June). Evaluating host aware SMR drives. In 8th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 16). USENIX Association. [PDF]

Wen, H., Du, D. H., Shetti, M., Voigt, D., & Li, S. (2016, August). Guaranteed Bang for the Buck: Modeling VDI Applications with Guaranteed Quality of Service. In Parallel Processing (ICPP), 2016 45th International Conference on (pp. 426-431). IEEE. [PDF]

Ge, X., Liu, Y., Lu, C., Diehl, J., Du, D. H., Zhang, L., & Chen, J. (2016, May). VNRE: Flexible and Efficient Acceleration for Network Redundancy Elimination. In Parallel and Distributed Processing Symposium, 2016 IEEE International (pp. 83-92). IEEE. [PDF]

Haghdoost, A., & Du, D. H. (2016, May). OWBP: Flash-Aware Offline Write Buffer Policy. In Parallel and Distributed Processing Symposium Workshops, 2016 IEEE International (pp. 1752-1758). IEEE. [PDF]

Cao, X., Panchputre, K. K., & Du, D. H. C. (2016, April). Accelerating data shuffling in MapReduce framework with a scale-up NUMA computing architecture. In Proceedings of the 24th High Performance Computing Symposium (p. 17). Society for Computer Simulation International. [PDF]

Li, B., Minglani, M., & Lilja, D. (2016, August). Ps-Code: A New Code for Improved Degraded Mode Read and Write Performance of RAID Systems. In Networking, Architecture and Storage (NAS), 2016 IEEE International Conference on (pp. 1-10). IEEE. [PDF]

2015

Liu, Y., Ge, X., Huang, X., & Du, D. H. (2015). MOLAR: A Cost-Efficient, High-Performance SSD-Based Hybrid Storage Cache. The Computer Journal, bxu156.

He, W., Du, D. H., & Narasimhamurthy, S. B. (2015, May). PIONEER: A solution to parallel I/O workload characterization and generation. In Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on (pp. 111-120). IEEE.

Cao, X., Dong, Y., & Du, D. H. C. (2015, August). Synchronized Multi-Hop Scheduling for Real-Time Traffic on SDNs. In Computer Communication and Networks (ICCCN), 2015 24th International Conference on (pp. 1-8). IEEE.

Mandagere, N., Routray, R., Song, Y., & Du, D. (2015, August). Cloud object storage based Continuous Data Protection (cCDP). In Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on (pp. 244-254). IEEE.

Liu, Y., Ge, X., Du, D. H., & Huang, X. (2016). SSD as a Cloud Cache? Carefully Design about It. ëŠÄXŒW¿¯, 27(1), 26-37.

Minglani, M., Nagarajan, A., Deshapande, S., Everson, L., & Lilja, D. J. (2015, August). Design space exploration for efficient computing in Solid State drives with the Storage Processing Unit. In Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on (pp. 87-94). IEEE.

Z. Fan, Alireza Haghdoost, David H.C. Du and Doug Voigt, "I/O-Cache: A Non-Volatile Memory Based Buffer Cache Policy to Improve Storage Performance," In IEEE 23nd International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS'15), full paper, October 2015. [PDF]

2014

Ziqi Fan, David H.C. Du and Doug Voigt, "H-ARC: A Non-Volatile Memory Based Cache Policy for Solid State Drives," 30th IEEE Conference on Massive Storage Systems and Technologies (MSST'14), full paper, June 2014. [PDF]

M. Murugan, Krishna Kant, A. Raghavan and David Du (2014). FlexStore: A Software Defined Energy Adaptive Distributed Storage Framework. IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems. Paris, France. [PDF]

M. Murugan, K. Kant, A. Raghavan and D. Du (2014). Group Based Energy Adaption for Storage Systems. 9th International Workshop on Feedback Computing. Philadelphiia, PA. [PDF]

Weiping He and David H.C. Du, "Novel Address Mappings for Shingled Write Disks," 6th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage'14), June 2014. [PDF]

Yi Liu, Xiongzi Ge, Xiaoxia Huang, David Du, MOLAR: A High Performance, Cost-Efficient SSD-based Hybrid Storage Cache, In the Computer Journal 2014

Xiongzi Ge, Zhichao Cao, Pradeep Ganesan, Dennis Hahn, David Du, OneStore: Integrating Local and Cloud Storage with Access Hints, In the Poster Session of 2014 ACM Symposium on Cloud Computing. November 3-5, 2014, Seattle, WA 

Yi Liu, Xiongzi Ge, Xiaoxia Huang, David Du, Par-BF: a Parallel Partitioned Bloom Filter for Dynamic Data Sets, Proceedings of workshop on Data-Intensive Scalable Computing Systems (DISCS), in conjunction with SC'14. November 16th, 2014, New Orleans LA, USA 

Xiongzi Ge, Yi Liu, Liang Zhang, Hongguang Guan, Jian Chen, Yuping Zhao,Xinyu Hu, David Du OpenANFV: Accelerating Network Function Virtualization with a Consolidated Framework in OpenStack, A demo in ACM SIGCOMM'14, August 2014, Chicago, Illinois 

2013

Y.H. Chang, P.C. Huang, P.H. Hsu, L.J. Lee, T.W. Kuo, David Du, Reliability Enhancement of Flash-Memory Storage Systems: An Efficient Version-Based Design, IEEE Transactions on Computers, Vol. 62, No. 12, pp. 2503-2515, December 2013

Peng Li, Kevin Gomez, and David J. Lilja, Exploiting Free Silicon for Energy-Efficient Computing Directly in NAND Flash-based Solid-State Storage Systems, IEEE High Performance Extreme Computing Conference (HPEC), September, 2013.(Best Paper session)

Peng Li, Kevin Gomez, and David J. Lilja, Using Solid-State Drives as Storage Processing Units, Non-Volatile Memories Workshop (NVMW), poster presentation, March, 2013.

Peng Li, James Hughes, John Plocher, and David J. Lilja, SATA Port Multipliers Considered Harmful, USENIX Conference on File and Storage Technologies (FAST), poster presentation, February, 2013.

2012

Zhe Zhang, Weijun Xiao, Nohhyun Park and David J. Lilja, "Memory Module-level Testing and Error Behaviors for Phase Change Memory," 30th IEEE International Conference on Computer Design(ICCD), October 2012. [PDF]

Nohhyun Park, Irfan Ahmed and David J. Lilja. "Romano: Right Workload for Right Storage," Proceedings of the Third ACM Symposium on Cloud Computing (SoCC '12), October 2012. [PDF]

Young Jin Nam, Dongchul Park and David H.C. Du,"Assuring Demanded Read Performance of Data Deduplication Storage with Backup Datasets," 20th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), August 2012.

Chung-I Lin, Dongchul Park, Weiping He and David H.C. Du,"H-SWD: Incorporating Hot Data Identification into Shingled Write Disks," 20th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), August 2012. [PDF]

Muthukumar Murugan and David H.C. Du,"Hybrot: Towards Improved performance in hybrid SLC-MLC devices," 20th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS) (Short Paper), August 2012. [PDF]

Guanlin Lu, Young Jin Nam, and David H.C. Du, "BloomStore: Bloom-Filter based Memory-Efficient Key-Value Store for Indexing of Data Deduplication on Flash," 28th IEEE Conference on Massive Data Storage (MSST/SNAPI), April 2012.

Wejun Xiao, Xiaoqiang Lei, Ruixuan Li, Nohhyun Park, and David J. Lilja, "PASS: A Hybrid Storage System for Performance-Synchronization Tradeoffs Using SSDs," 10th IEEE International Symposium on Parallel and Distributed Processing with Application (ISPA-12), July 2012.

Weijun Xiao, Peng Li, and David J. Lilja, "Comparing the Performance of Stochastic Simulation on GPUs and OpenMP," International Journal of Computational Science and Engineering (IJCSE), 2012.

Peng Li, Weijun Xiao, and David J. Lilja, "GPU-based Simulation for Stochastic Computing," The 2nd International Workshop on GPUs and Scientific Applications (GPUScA 2011), October 2011.

Dongchul Park, Young Jin Nam and David H.C. Du. "A Look-ahead Read Cache: Improving Read Performance of Deduplication Storage for Backup Applications" (submitted for review).

K. Kant, Muthukumar Murugan, David H.C. Du "Enhancing Datacenter Sustainability through Energy Adaptive computing," ACM journal on Emerging Technologies in Computing, April 2012.

Muthukumar Murugan, David H.C. Du, K. Kant "Energy Adaptation for Multi-tiered Datacenter Applications," Intel Technology Journal on Energy and Sustainability, 2012. [PDF]

Muthukumar Murugan, David H.C. Du, K. Kant "On the Interconnect Energy Efficiency of High End Computing Systems," Special issue of Sustainable Computing: Informatics and Systems (SUSCOM) on Green High Performance Computing, Elsevier Press, 2012. [PDF]

Youngjin Nam, Dongchul Park and David H.C. Du, "Virtual USB Drive - A Key Component for Smart Home Storage Architecture," IEEE International Conference on Consumer Electronics (ICCE), January 2012.

Dongchul Park, Biplob Debnath, Youngjin Nam, David H.C. Du, Youngkyun Kim and Youngchul Kim, "HotDataTrap: A Sampling-based Hot Data Identification Scheme for Flash Memory," ACM Symposium on Applied Computing (SAC), March 2012.

2011

Youngjin Nam, Guanlin Lu, Nohhyun Park, Weijun Xiao and David H.C. Du, "Chunk Fragmentation Level: An Effective Indicator for Read Performance Degradation in Deduplication Storage," IEEE International Symposium of Advances on High Performance Computing and Networking (HPCC/AHPCN), September 2011.

Joseph L. Naps, Mohamed F. Mokbel, and David H. C. Du. "Pantheon: Exascale File System Search for Scientific Computing" (Short Paper). In Proceedings of the International Conference on Scientific and Statistical Database Management (SSDBM), Portland, OR, July 2011.

Pei Han Hsu, Yuan-Hao Chang, Po-Chung Huang, Tei-Wei Kuo and David H.C. Du, "A Version-based Strategy for Reliability Enhancement of Flash File Systems", IEEE Design Automation Conference, 2011

Xiongzi Ge, Feng Dan and David Du, "DiscPOP: Power-Aware Buffer Management for Disk Accesses", (short paper) Proceedings of the Second Green Computing Conference, Orlando, Florida, July 2011

Youngjin Nam, Guanlin Lu and David H.C. Du, "Reliability-Aware Deduplication Storage: Assuring Chunk Reliability and Chunk Loss Severity," The First International Workshop on Energy Consumption and Reliability of Storage Systems (IGCC/ERSS), July 2011

Krishna Kant, Muthukumar Murugan and David H.C. Du "Willow: A Control System for Energy and Thermal Adaptive Computing", in Proceedings of IEEE International Parallel and Distributed Processing Symposium(IPDPS), May 2011

Dongchul Park, Biplob Debnath, and David H.C. Du, "A Workload-Aware Adaptive Hybrid Flash Translation Layer with an Efficient Caching Strategy," In Proceedings of the 19th IEEE International Symposium on Modeling, Analysis and Simulations of Computer and Telecommunication Systems (MASCOTS 2011), Singapore, July 2011.

Dongchul Park and David H.C. Du, "Hot Data Identification for Flash-based Storage Systems Using Multiple Bloom Filters," 27th IEEE Conference on Massive Storage Systems and Technologies (MSST), May 2011.

Muthukumar Murugan and David H.C. Du, "Rejuvenator: A Static Wear Leveling Algorithm for NAND Flash Memory, " 27th IEEE Conference on Massive Storage Systems and Technologies (MSST), May 2011.

Biplob Debnath, Srinivasan Krishnan, Weijun Xiao, David J.Lilja and David H.C.Du, "Sampling-based Garbage Collection Metadata Management for Flash-based Storage (Short Paper)," 27th IEEE Conference on Massive Storage Systems and Technologies (MSST), May 2011.

Nohhyun Park, Weijun Xiao, Kyubaik Choi and David J. Lilja? "A Statistical Evaluation of the Impact of Parameter Selection on Storage System Benchmarks?quot; - 7th IEEE International Workshop on Storage Network Architecture and Parallel I/O (SNAPI ) May 2011. 

Guanlin Lu, Biplob Debnath and David H.C. Du, "A Forest-structured Bloom Filter with Flash Memory (Short Paper)," 27th IEEE Conference on Massive Storage Systems and Technologies (MSST), May 2011.

Past Publications before 2011

Dongchul Park, Biplob Debnath and David H.C. Du, "CFTL: A Convertible Flash Translation Layer with Consideration of Data Access Patterns (Short Paper)," SIGMETRICS 2010

Biplob Debnath, Mohamed F. Mokbel, David J. Lilja and David H.C. Du, "Deferred Updates for Flash Based Storage (Short Paper)," 26th IEEE Conference on Massive Storage Systems and Technologies (MSST), May 2010.

Guanlin Lu, Yu Jin and David H.C. Du, "Frequency Based Chunking Algorithm for Data Deduplication," in the 18th Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), August 2010.

Aaram Yun, Chunhui Shi, and Yongdae Kim, "On Protecting Integrity and Confidentiality of Cryptographic File System for Outsourced Storage," The ACM Cloud Computing Security Workshop (CCSW), 2009.

Biplob Debnath, Sunil Subramanya, David H.C. Du, and David J. Lilja, "Large Block CLOCK (LB-CLOCK): A Write Caching Algorithm for Solid State Disks," accepted for publication in the MASCOTS 2009.

Chuanyi Liu, Yingping Lu, Chunhui Shi, Guanlin Lu, David H.C. Du, and Dong-Sheng Wang, "ADMAD: Application-Driven Metadata Aware De-duplication Archival Storage System," SNAPI '08, 5th IEEE International Workshop on Storage Network Architecture and Parallel I/Os.

Biplob K. Debnath, Mohamed F. Mokbel, and David J. Lilja, "Exploiting the Impact of Database System Configuration Parameters: A Design of Experiments Approach," Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, Special Issue on Testing and Tuning of Database Systems, Vol. 31, No. 1, March, 2008, pp. 3–10. 

Biplob K. Debnath, David J. Lilja, and Mohamed Mokbel, "SARD: A Statistical Approach for Ranking Database Tuning Parameters," International Workshop on Self-Managing Database Systems (SMDB), IEEE International Conference on Data Engineering (ICDE), April, 2008.