Non-volatile memory (NVM) technology holds promise to replace SRAM and DRAM at various levels of the memory hierarchy. The interest in NVM is motivated by the difficulty faced in scaling DRAM beyond 22 nm and, long-term, lower cost per bit. While offering higher density and negligible static power (leakage and refresh), NVM suffers increased latency and energy per memory access. This paper dev…
In out-of-order (OoO) processors, speculative execution with high branch prediction accuracy is employed to achieve good single thread performance. In these processors the branch prediction unit tables (BPU) are accessed in parallel with the instruction cache before it is known whether a fetch group contains branch instructions. For integer applications, we find 85 percent of BPU lookups are d…
This paper proposes persistent transactional memory (PTM), a new design that adds durability to transactional memory (TM) by incorporating with the emerging non-volatile memory (NVM). PTM dynamically tracks transactional updates to cache lines to ensure the ACI (atomicity, consistency and isolation) properties during cache flushes and leverages an undo log in NVM to ensure PTM can always consis…
Memory bottleneck has always been a major cause for limiting the performance of computer systems. While in the past latency was the major concern, today, lack of bandwidth becomes a limiting factor as well, as a result of exploiting more parallelism with the growing number of cores per die, which intensifies the pressure on the memory bus. In such an environment, any additional traffic to memor…
Integrated CPU-GPU architectures with a fully addressable shared memory completely eliminate any CPU-GPU data transfer overhead. Since such architectures are relatively new, it is unclear what level of interaction between the CPU and GPU attains the best energy efficiency. Too coarse grained or larger kernels with fairly low CPU - GPU interaction could cause poor utilization of the shared resou…
Abstract—In this letter, a flexible memory simulator - NVMain 2.0, is introduced to help the community for modeling not only commodity DRAMs but also emerging memory technologies, such as die-stacked DRAM caches, non-volatile memories (e.g., STT-RAM, PCRAM, and ReRAM) including multi-level cells (MLC), and hybrid non-volatile plus DRAM memory systems. Compared to existing memory simulators, N…
Many-Accelerator (MA) systems have been introduced as a promising architectural paradigm that can boost performance and improve power of general purpose computing platforms. In this paper, we focus on the problem of resource under-utilization, i.e. Dark Silicon, in FPGA-based MA platforms. We show that except the typically expected peak power budget, on-chip memory resources form a severe under…
Switch on Event Multithreading (SoE MT, also known as coarse-grained MT and block MT) processors run multiple threads on a pipeline machine, while the pipeline switches threads on stall events (e.g., cache miss). The thread switch penalty is determined by the number of stages in the pipeline that are flushed of in-flight instructions. In this paper, Continuous Flow Multithreading (CFMT), a new …
To address the Dark Silicon problem, architects have increasingly turned to special-purpose hardware accelerators to improve the performance and energy efficiency of common computational kernels, such as encryption and compression. Unfortunately, the latency and overhead required to off-load a computation to an accelerator sometimes outweighs the potential benefits, resulting in a net decrease …
A novel method to protect a system against errors resulting from soft errors occurring in the virtual address (VA) storing structures such as translation lookaside buffers (TLB), physical register file (PRF) and the program counter (PC) is proposed in this paper. The work is otivated by showing how soft errors impact the structures that store virtual page numbers (VPN). A solution is proposed b…
Power mismatching between supply and demand has emerged as a top issue in modern datacenters that are under-provisioned or powered by intermittent power supplies. Recent proposals are primarily limited to leveraging uninterruptible power supplies (UPS) to handle power mismatching, and there fore lack the capability of efficiently handling the irregular peak power mismatches. In this paper we p…
Web browsing on mobile devices is undoubtedly the future. However, with the increasing complexity of webpages, the mobile device’s computation capability and energy consumption become major pitfalls for a satisfactory user experience. In this paper, we propose a mechanism to effectively leverage processor frequency scaling in order to balance the performance and energy consumption of mobile w…
JavaScript is a sequential programming language, and Thread-Level Speculation has been proposed to dynamically extract parallelism in order to take advantage of parallel hardware. In previous work, we have showed significant speed-ups with a simple on/off speculation heuristic. In this paper, we propose and evaluate three heuristics for dynamically adapt the speculation: a 2-bit heuristic, an e…
Bitwise operations are an important component of modern day programming, and are used in a variety of applications such as databases. In this work, we propose a new and simple mechanism to implement bulk bitwise AND and OR operations in DRAM, which is faster and more efficient than existing mechanisms. Our mechanism exploits existing DRAM operation to perform a bitwise AND/OR of two DRAM rows c…
We study the tradeoffs between Many-Core machines like Intel’s Larrabee and Many-Thread machines like Nvidia and AMD GPGPUs. We define a unified model describing a superposition of the two architectures, and use it to identify operation zones for which each machine is more suitable. Moreover, we identify an intermediate zone in which both machines deliver inferior performance. We study the sh…
gem5-gpu is a new simulator that models tightly integrated CPU-GPU systems. It builds on gem5, a modular full-system CPU simulator, and GPGPUSim, a detailed GPGPU simulator. gem5-gpu routes most memory accesses through Ruby, which is a highly configurable memory system in gem5. By doing this, it is able to simulate many system configurations, ranging from a system with coherent caches and a si…
Over the past few years, there has been vast growth in the area of the web browser as an applications platform. One example of this trend is Google’s Native Client (NaCl) platform, which is a software-fault isolation mechanism that allows the running of native x86 or ARM code on the browser. One of the security mechanisms employed by NaCl is that all branches must jump to the start of a valid…
Consider a workload comprising a consecutive sequence of program execution segments, where each segment can either be executed on general purpose processor or offloaded to a hardware accelerator. An analytical optimization framework based on MultiAmdhal framework and Lagrange multipliers, for selecting the optimal set of accelerators and for allocating resources among them under constrained are…
Memory access times are the primary bottleneck for many applications today. This “memory wall” is due to the performance disparity between processor cores and main memory. To address the performance gap, we propose the use of custom memory subsystems tailored to the application rather than attempting to optimize the application for a fixed memory subsystem. Custom subsystems can take advant…
With the trend towards increasing number of cores in a multicore processors, the on-chip network that connects the cores needs to scale efciently. In this work, we propose the use of high-radix networks in on-chip networks and describe how the attened buttery topology can be mapped to on-chip networks. By using high-radix routers to reduce the diameter of the network, the attened buttery o…
The Roofline model graphically represents the attainable upper bound performance of a computer architecture. This paper analyzes the original Roofline model and proposes a novel approach to provide a more insightful performance modeling of modern architectures by introducing cache awareness, thus significantly improving the guidelines for application optimization. The proposed model was experim…
DRAM scaling has been the prime driver of increasing capacity of main memory systems. Unfortunately, lower technology nodes worsen the cell reliability as it increases the coupling between adjacent DRAM cells, thereby exacerbating different failure modes. This paper investigates the reliability problem due to Row Hammering, whereby frequent activations of a given row can cause data loss for its…
We present a method for accelerating server applications using a hybrid CPU+FPGA architecture and demonstrate its advantages by accelerating Memcached, a distributed key-value system. The accelerator, implemented on the FPGA fabric, processes request packets directly from the network, avoiding the CPU in most cases. The accelerator is created by profiling the application to determine the most c…
Accelerators integrated on-die with General-Purpose CPUs (GP-CPUs) can yield significant performance and power improvements. Their extensive use, however, is ultimately limited by their area overhead; due to their high degree of specialization, the opportunity cost of investing die real estate on accelerators can become prohibitive, especially for general-purpose architectures. In this paper we…
Network-on-Chip (NoC) paradigm is rapidly evolving into an efficient interconnection network to handle the strict communication requirements between the increasing number of cores on a single chip. Diminishing transistor size is making the NoC increasingly vulnerable to both hard faults and soft errors. This paper concentrates on soft errors in NoCs. A soft error in an NoC router results in sig…
In-wheel direct drive motors are placed inside drive wheels of electric vehicles and have to deliver large torque without any mechanical gear. Most often these machines are of synchronous type with permanent magnets and large pole-pair number. As a consequence of high peak torque, both cogging torque and torque ripple are increased if no measures are taken to decrease them. The vibrations due t…
A developed control strategy for autonomous wind generation unit is presented. Optimization techniques based on bacteria foraging algorithm and genetic algorithms are used to tune the proposed control parameters. The proposed system mainly consists of induction generator driven by wind turbine, synchronous machine, consumer load, discrete dump load, and gate turn-off thyristor-based power elec…
Permanent magnet (PM) synchronous motors with starting cage, already developed in the 1960s for variable-speed textile drives, become increasingly important as energy-efficient PM line-start motors. Differing from the standard design with integrated magnets, a rotor design with surface-mounted magnets is presented. In the planning and design phase of drives with PM line-start motors, the simul…
1. Growth rates of European silver fir (Abies alba Mill.) rapidly increased in the last century. At the same time, ring widths declined at the species southern distribution limits in the Mediterranean. Such diverse growth trends and responses have largely been attributed to regional climate conditions, but this was prior to considering the species’ post-glacial phylogeny. 2. A dendrochr…
Colon cancer is one of the most commonly diagnosedcancers in the United States, and an ncreasing number of survivors has underscored the need for improved colon cancer surveillance care uality. Post-treatment surveillance includes follow-up care visits and tests as well as psychosocial support and lifestyle counseling. This formative study explored the individual, interpersonal, and organizati…
Lung cancer chemotherapy decisions in patients ≥70 years old are complex. To assess the modes of communication with older lung cancer patients, we prospectively collected data.We assessed patients’ level of knowledge about diagnosis and prognosis. Eighty-three patients diagnosed with lung cancer from January 2006 to February 2008 were recruited from a single center. Logistic regression and…
Community-based participatory (CBP) strategies are considered important to efforts to eliminate disparities.This paper outlines how the Program for the Elimination of Cancer Disparities PECaD) uses CBP strategies as a part of a long-term cancer education, prevention, and control strategy in an urban community. Community partnerships have proved to be vital resources to inform ECaD’s agenda a…
Traditionally, postdoctoral training programs largely have focused efforts within a single Discipline or closely related fields. Yet, addressing the complex questions around cancer prevention and control increasingly requires the ability to work and communicate across disciplines in order to gain a perspective that encompasses the multilevel and multifaceted issues involved with this public he…
Despite the increasing number of medical articles being published from the Middle East, clinical research is still lagging behind compared to other regions. Enrolling participants into clinical trials presents an important challenge. We wanted to explore the perception, knowledge, and willingness of cancer patients to participate in oncology clinical trials and to recommend strategies to overco…
The objective of this study is to identify levels of risk awareness for breast, lung and ervical cancer, in a UK student population. A sample of male (N=62) and female (N=58) university tudents, mean age 21.62 years completed a questionnaire identifying which risk factors they knew for each cancer. Analysis of variance was used to compare differences in risk awareness across gender and cancer t…
The American Society of Hematology developed the Clinical Research Training Institute (CRTI) to address the lack of training in patient-oriented research among hematologists. As the program continues, we need to consider metrics for measuring the benefits of such a training program. This article addresses the benefits of clinical research training programs. The fundamental and key components ar…
Comprehension of randomization is a vital, but understudied, component of informed consent to participate in cancer randomized clinical trials (RCTs). This study examines patient comprehension of the randomization process as well as sources of ongoing uncertainty that may inhibit a patient’s ability to provide informed consent to participate in RCTs. Cancer patients living in rural Appalachia…
As of January, 2012, an estimated 13.7 million persons are living as cancer survivors. This population is expected to grow to nearly 18 million by 2022. While their treatment may be considered successful, many cancer survivors experience long-term physical, emotional, and psychosocial late effects of treatment. Our focus was on community based cancer care—both rural and urban—as almost 90 %…
E-cigarette use has increased rapidly over the past decade. There is growing concern about e-cigarette use and advertising given limited regulation of these products. This cross-sectional study reports on data collected at baseline from hospitalized cigarette smokers (N=944) cruited in monthly cohorts between December 2012 and September 2013. Participants were queried regarding e-cigarette aw…
Patient navigation (PN) may reduce cancer health disparities. Few studies have investigated the effects of PN on patient-reported satisfaction with care or assessed patients’satisfaction with naviGators. The objectives of this study are to test the effects of PN on patient satisfaction with cancer care,assess patients’ satisfaction with navigators, and examine the impact of barriers to care…
Skin cancer is the most common cancer in the USA and is increasing in children and young adults. Adolescents are an important target population for sun-safety interventions with ultraviolet radiation as the strongest risk factor for developing skin cancer. Schools are an ideal setting to intervene with adolescents. A novel Arizona skin cancer prevention inclass education-activity program, Proje…
The barbershop is a promising setting where African-American men might receive information and education about prostate cancer. In this study, we assessed the feasibility of engaging rural barbershops as venues for barbers to deliver a prostate cancer education intervention to increase informed decision-making for prostate cancer screening among customers. Twelve barbershops were recruited fro…
African Americans (AA) are more likely to develop and die from cancer than any other racial or ethnic group. The aims of this research were to (1) evaluate current education materials being implemented in a community-based prostate cancer education program for AA communities, (2) refine materials based on findings from aim 1, (3) share updated materials with participants from aim 1 for additio…
Health issues are a concern in Alabama’s Black Belt region, which runs across the southwestern part of the state and includes some of the poorest counties in the USA. As part of a Center for Disease Control (CDC)-sponsored study, we collected data covering several cancer (e.g., prostate, breast, skin) and other health-related indicators (e.g., stress,insurance, stroke, heart isease) from 647…
To better prepare medical students to care for patients in today’s changing health-care environment as they transition to continuing their education as residents, many US medical schools have been reviewing and modifying their curricula and are considering integration of newer adult learning techniques, including team-based learning, flipped classrooms, and other active learning approaches (A…
Although the availability of direct-to-consumer (DTC) genetic testing has increased in recent years, the general public’s awareness of this testing is not well understood. This study examined levels of public awareness of DTC genetic testing, sources of information about testing, and psy-chosocial factors associated with awareness of testing in the USA. Data were obtained from the nationally …
In 1996, the Toronto Sunnybrook Regional Cancer Centre developed the Rapid Response Radiotherapy Program (RRRP). The objective of this clinic is to consult, simulate, plan, and treat patients with palliative radiotherapy on the same day. In 2004, the RRRP initiated a program to provide clinical and research experience to undergraduate students interested in health sciences. The purpose of this…
Mobile phone-based interventions can play a significant hole in decreasing health disparities by enhancing population and individual health. The purpose of this study was to explore health ministry leaders (HMLs) and congregation members’ communication technology usage and to assess the acceptability of mobile technology for delivery of cancer information. Six focus groups were conducted in …