• 【胎儿和母体QRS复合体重合对胎儿心率可靠性的影响。】 复制标题 收藏 收藏
    DOI:10.1007/s11517-006-0054-0 复制DOI
    作者列表:Matonia A,Jezewski J,Kupka T,Horoba K,Wrobel J,Gacek A
    BACKGROUND & AIMS: :Bioelectrical fetal heart activity being recorded from maternal abdominal surface contains more information than mechanical heart activity measurement based on the Doppler ultrasound signals. However, it requires extraction of fetal electrocardiogram from abdominal signals where the maternal electrocardiogram is dominant. The simplest technique for maternal component suppression is a blanking procedure, which relies upon the replacement of maternal QRS complexes by isoline values. Although, in case of coincidence of fetal and maternal QRS complexes, it causes a loss of information on fetal heart activity. Its influence on determination of fetal heart rate and the variability analysis depends on the sensitivity of the heart-beat detector used. The sensitivity is defined as an ability to detect the incomplete fetal QRS complex. The aim of this work was to evaluate the influence of the maternal electrocardiogram suppression method used on the reliability of FHR signal being calculated.
    背景与目标: : 从母体腹部表面记录的生物电胎儿心脏活动比基于多普勒超声信号的机械心脏活动测量包含更多信息。但是,它需要从母体心电图占主导地位的腹部信号中提取胎儿心电图。抑制母体成分的最简单技术是消隐程序,该程序依赖于用等值线值代替母体QRS复合物。尽管在胎儿和母体QRS复合物重合的情况下,它会导致有关胎儿心脏活动的信息丢失。它对确定胎儿心率和变异性分析的影响取决于所使用的心跳检测器的灵敏度。敏感性定义为检测不完整胎儿QRS复合物的能力。这项工作的目的是评估所使用的母体心电图抑制方法对所计算的FHR信号可靠性的影响。
  • 【PedsQL脑肿瘤模块: 初始信度和效度。】 复制标题 收藏 收藏
    DOI:10.1002/pbc.21026 复制DOI
    作者列表:Palmer SN,Meeske KA,Katz ER,Burwinkle TM,Varni JW
    BACKGROUND & AIMS: BACKGROUND:Brain tumors (BT) are second only to acute lymphoblastic leukemia as the most prevalent form of pediatric cancer, with BT 5-year survival rates approaching 70%. With increased survival, quality of life has emerged as an essential health outcome. This investigation examines the internal consistency reliability and construct validity of the Pediatric Quality of Life Inventory (PedsQL) Brain Tumor Module. METHODS:The PedsQL 4.0 Generic Core Scales, PedsQL Multidimensional Fatigue Scale, and PedsQL Brain Tumor Module were administered to 99 families. The average age of the 56 boys and 43 girls was 9.76 years (range=2-18 years). The sample included children with tumors located in the posterior fossa/brainstem (N=62, 62.6%), supratentorial (N=15, 15.2%), and midline (N=22, 22.2%). Children were on treatment (N=46, 46.5%), off treatment<12 months (N=19, 19.2%), or off treatment>12 months/long-term survivor (N=34, 34.3%). Treatment included radiation (N=61, 61.6%), surgery (N=83, 83.8%), chemotherapy (N=87, 87.9%), and bone marrow transplant (N=5, 5.1%). RESULTS:Internal consistency reliability was demonstrated for the 24-item PedsQL Brain Tumor Module (average alpha=0.78-0.92, parent proxy-report, n=99; average alpha=0.76-0.87, child self-report, n=51). Construct validity for the PedsQL Brain Tumor Module was supported through an analysis of the intercorrelations with the Generic Core Scales and Fatigue Scale. CONCLUSIONS:The findings provide support for the measurement properties of the PedsQL Brain Tumor Module.
    背景与目标:
  • 【在被治疗的性犯罪者样本中,对犯罪情绪量表的有效性和可靠性进行了调查。】 复制标题 收藏 收藏
    DOI:10.1177/107906320601800303 复制DOI
    作者列表:Witte TD,Di Placido C,Gu D,Wong SC
    BACKGROUND & AIMS: :The Criminal Sentiments Scale (CSS) purports to measure attitudes and values pertaining to and in support of criminal conduct (Andrews & Bonta, 2003). A sample of 72 sex offenders treated in a high intensity sex offender program completed the CSS pre- and post-treatment and was followed up for approximately 3 years. The CSS demonstrated high internal consistency. Correlational and ROC analyses indicated that the CSS is a robust predictor of non-sexual violent and non-violent recidivism. The CSS also correlated significantly with the General Statistical Information on Recidivism Scale which assesses non-sexual recidivism risk. Similar analyses showed that the CSS did not predict sexual recidivism nor did it correlate with a tool that assesses sexual recidivism risk-the Static 99. The CSS scores changed significantly in the predicted direction following treatment. The present results suggest that the CSS is an internally consistent tool for the assessment of criminal sentiments among sex offenders and that it predicts non-sexual but not sexual recidivism.
    背景与目标: : 犯罪情绪量表 (CSS) 旨在衡量与犯罪行为有关并支持犯罪行为的态度和价值观 (Andrews & Bonta,2003)。在高强度性犯罪者计划中接受治疗的72名性犯罪者的样本完成了CSS的治疗前和治疗后,并进行了大约3年的随访。CSS显示出很高的内部一致性。相关分析和ROC分析表明,CSS是非性暴力和非暴力累犯的可靠预测指标。CSS还与累犯量表上的一般统计信息显着相关,该量表评估了非性累犯风险。类似的分析表明,CSS不能预测性累犯,也不能与评估性累犯风险的工具-静态99相关。治疗后,CSS得分在预测方向上发生了显着变化。目前的结果表明,CSS是一种内部一致的工具,用于评估性犯罪者的犯罪情绪,并且可以预测非性行为,而不是性累犯。
  • 【用于 “客观” 测量颞下颌疾病疼痛的视觉模拟和言语描述符量表的可靠性。】 复制标题 收藏 收藏
    DOI:10.1177/00220345880670010601 复制DOI
    作者列表:Le Resche L,Burgess J,Dworkin SF
    BACKGROUND & AIMS: :Eight dentists viewed standardized videotapes showing palpations of the temporomandibular joint and muscles of mastication and recorded their judgments concerning the amount of pain the patient was experiencing. Judgments were recorded using a four-point verbal descriptor scale (VDS) ("none", "mild", "moderate", "severe" pain) or a 100-mm visual analog scale (VAS) anchored with the terms "no pain" and "worst pain possible". Test/re-test reliability over a one-week period and interjudge reliabilities were calculated for each scale; reliabilities of the two scales were directly compared based on the statistical equivalence of weighted kappa and the Intraclass Correlation Coefficient. Neither scale showed satisfactory reliability. Median test/re-test reliabilities were k = 0.590 for the VDS and r = 0.822 for the VAS. Interjudge reliabilities averaged k = 0.394 for the VDS and r = 0.735 for the VAS. Direct comparison of reliabilities for the two scales showed no clear advantage for either scale. The marginal reliabilities of these scales, when used by dentists to quantify the patient's pain, suggest that neither scale should be regarded as an "objective" pain measure.
    背景与目标: : 八名牙医观看了显示颞下颌关节触诊和咀嚼肌肉的标准化录像带,并记录了他们对患者疼痛程度的判断。使用四点言语描述符量表 (VDS) (“无”,“轻度”,“中度”,“严重” 疼痛) 或以术语 “无” 固定的100毫米视觉模拟量表 (VAS) 记录判断。疼痛 ”和“ 可能的最严重疼痛 ”。在一周内进行测试/重新测试的可靠性,并为每个量表计算相互判断的可靠性; 根据加权kappa的统计等效性和组内相关系数,直接比较了两个量表的可靠性。两个量表都没有显示出令人满意的可靠性。VDS的中位测试/再测试可靠性为k = 0.590,VAS的r = 0.822。对于VDS,判断间可靠性平均为k = 0.394,对于VAS,判断间可靠性平均为r = 0.735。两个量表的可靠性的直接比较显示出两个量表都没有明显的优势。当牙医使用这些量表来量化患者的疼痛时,这些量表的边际可靠性表明,这两个量表都不应被视为 “客观” 的疼痛度量。
  • 【浴室体重秤和血压计在量化数字后-前脊柱压力大小方面的可靠性和并发有效性。】 复制标题 收藏 收藏
    DOI:10.3233/THC-130741 复制DOI
    作者列表:Teslim OA,Olayinka A,Michael E,Adesoji AR,Oluwole AT
    BACKGROUND & AIMS: PURPOSE:The main objective of this study was to determine the reliability and concurrent validity of using Modified Sphygmomanometer (sphyg) and body weighing scale to determine the magnitude of pressure and body weight change during digital spinal examination. SUBJECTS:One hundred and twenty apparently healthy subjects were recruited. METHODS:Weighing scale was used to measure the body weight of the investigator while Mercury sphygmomanometer was used to measure the pressure exerted. There was test re-test interval of 2 weeks. Data were analysed by descriptive statistic and Pearson product moment correlation. RESULTS:The result showed that there was significant correlation between the weight change (WC) values obtained on the weighing scale on day 1 and 2 (r=0.91). Also, there was significant correlation between Pressure obtained in the sphygmomanometer on day 1 and 2 (r=0.94). Similarly, there was significant correlation between the WC and Pressure Change (PC), (r=-0.49). The participant's age, height, weight and BMI contributed significantly as predictors for both BWC and PC change respectively. CONCLUSION:This study concluded that weighing scale and Mercury sphygmomanometer are reliable and valid to determine the magnitude of pressure applied during DPAP.
    背景与目标:
  • 【脑瘫儿童下肢手持测力计和功能强度测试的可靠性。】 复制标题 收藏 收藏
    DOI:10.1080/09638280701639873 复制DOI
    作者列表:Verschuren O,Ketelaar M,Takken T,Van Brussel M,Helders PJ,Gorter JW
    BACKGROUND & AIMS: PURPOSE:To evaluate the intertester reliability of two methods for measuring lower-limb strength in children with cerebral palsy (CP). METHOD:Twenty-five subjects with CP (7-17 years of age) participated in this study. Lower-limb muscle strength was measured on 2 occasions using a Hand-held Dynamometer (HHD; break-method and make-method) and a 30-sec Repetition Maximum (RM) during three functional strength tests for the lower extremities. Reliability was measured using the intraclass correlation coefficients (ICCs), the standard error of measurement (SEM) and the coefficient of variation (CV). RESULTS:The intertester reliability of strength measurement using a HHD was questionable with ICC values ranging from 0.42-0.73 for the break-method, and from 0.49-0.82 for the make-method. The SEM and CV (%) values ranged from 27.9-58.9 and 22.2-35.3% for the break-method, and from 30.6-52.7 and 16.2-56.2% for make-method. The intertester reliability of strength measurement using the 30-sec RM was acceptable with ICC values ranging from 0.91 -0.96, and SEM and CV (%) values ranging from 1.1-2.6 and 10.9-39.9% for the functional exercises. CONCLUSION:The intertester reliability of measuring muscle strength of the lower extremities using a hand-held dynamometer is questionable. The intertester reliability of the 30-sec RM for the lower extremity is acceptable.
    背景与目标:
  • 【稳定型精神分裂症患者的信度,效度和检测个人和社会绩效量表变化的能力。】 复制标题 收藏 收藏
    DOI:10.1016/j.psychres.2007.11.012 复制DOI
    作者列表:Nasrallah H,Morosini P,Gagnon DD
    BACKGROUND & AIMS: :This report describes the measurement properties of the Personal and Social Performance scale (PSP), a clinician-reported measure of severity of personal and social dysfunction, in an outpatient population with stabilized schizophrenia. Pooled data from two similar antipsychotic clinical studies were analyzed (n=411). The PSP showed good test-retest reliability (intraclass correlation coefficient=0.79). The PSP was more highly correlated with the Strauss-Carpenter Level of Function, an instrument measuring a similar construct, than the Positive and Negative Syndrome Scale, an instrument measuring a different construct. There was a statistically significant difference between mean PSP scores in subjects grouped by their severity rating on the Clinical Global Impression-Severity (CGI-S) (mild or less versus at least moderate), indicating the ability to discriminate between known groups. Effect sizes for mean change in the PSP based on 1-category improvement (0.72) or worsening (-0.88) versus no change in the CGI-S were moderate to large, demonstrating the ability to detect change. Estimates of between-group minimum important difference suggest that a 7-point improvement in the PSP may be clinically meaningful in a clinical trial setting. Initial reliability and validity assessments suggest the PSP may be a useful measure of social functioning in patients with stable schizophrenia.
    背景与目标: : 本报告描述了个人和社会绩效量表 (PSP) 的测量特性,PSP是临床医生报告的对稳定精神分裂症门诊患者个人和社会功能障碍严重程度的测量。分析了来自两个相似的抗精神病药物临床研究的合并数据 (n = 411)。PSP显示出良好的重测信度 (类内相关系数 = 0.79)。PSP与测量类似结构的仪器Strauss-Carpenter功能水平 (一种测量类似结构的仪器) 的相关性更高,而与测量不同结构的正负综合症量表 (一种测量不同结构的仪器) 的相关性更高。在按临床总体印象严重程度 (cgi-s) 的严重程度分组的受试者中,平均PSP得分之间存在统计学上的显着差异 (轻度或以下与至少中度),表明可以区分已知组。基于1-类别改善 (0.72) 或恶化 (-0.88) 与cgi-s无变化的PSP的平均变化的效应大小是中等到大的,表明检测变化的能力。组间最小重要差异的估计表明,在临床试验环境中,PSP的7点改善可能具有临床意义。最初的信度和效度评估表明,PSP可能是稳定精神分裂症患者社交功能的有用指标。
  • 【胜任力和理智意见的现场可靠性: 系统评价和荟萃分析。】 复制标题 收藏 收藏
    DOI:10.1037/pas0000388 复制DOI
    作者列表:Guarnera LA,Murrie DC
    BACKGROUND & AIMS: :We know surprisingly little about the interrater reliability of forensic psychological opinions, even though courts and other authorities have long called for known error rates for scientific procedures admitted as courtroom testimony. This is particularly true for opinions produced during routine practice in the field, even for some of the most common types of forensic evaluations-evaluations of adjudicative competency and legal sanity. To address this gap, we used meta-analytic procedures and study space methodology to systematically review studies that examined the interrater reliability-particularly the field reliability-of competency and sanity opinions. Of 59 identified studies, 9 addressed the field reliability of competency opinions and 8 addressed the field reliability of sanity opinions. These studies presented a wide range of reliability estimates; pairwise percentage agreements ranged from 57% to 100% and kappas ranged from .28 to 1.0. Meta-analytic combinations of reliability estimates obtained by independent evaluators returned estimates of κ = .49 (95% CI: .40-.58) for competency opinions and κ = .41 (95% CI: .29-.53) for sanity opinions. This wide range of reliability estimates underscores the extent to which different evaluation contexts tend to produce different reliability rates. Unfortunately, our study space analysis illustrates that available field reliability studies typically provide little information about contextual variables crucial to understanding their findings. Given these concerns, we offer suggestions for improving research on the field reliability of competency and sanity opinions, as well as suggestions for improving reliability rates themselves. (PsycINFO Database Record
    背景与目标: : 尽管法院和其他当局长期以来一直呼吁为被视为法庭证词的科学程序提供已知的错误率,但我们对法医心理学观点的评估间可靠性知之甚少。对于在该领域的常规实践中产生的意见尤其如此,甚至对于某些最常见的法医评估-对审判能力和法律健全性的评估。为了解决这一差距,我们使用了元分析程序和研究空间方法来系统地审查研究,这些研究检查了能力和理智意见的评分者间可靠性 (尤其是现场可靠性)。在59项已确定的研究中,9项涉及能力意见的现场可靠性,8项涉及理智意见的现场可靠性。这些研究提出了广泛的可靠性估计; 成对的百分比协议范围从57% 到100%,kappas范围从0.28到1.0。独立评估者获得的可靠性估计的元分析组合返回能力意见的 κ = .49 (95% CI: .40-.58) 和理智意见的 κ = .41 (95% CI: .29-.53) 的估计。这种广泛的可靠性估计强调了不同评估环境倾向于产生不同可靠性率的程度。不幸的是,我们的研究空间分析表明,可用的现场可靠性研究通常很少提供有关对理解其发现至关重要的上下文变量的信息。鉴于这些问题,我们提供了有关改进能力和健全性意见的现场可靠性研究的建议,以及提高可靠性率本身的建议。(PsycINFO数据库记录
  • 【限制拉丁裔学龄前儿童观看电视的目标导向行为模型: 有效性和可靠性。】 复制标题 收藏 收藏
    DOI:10.1186/s12889-020-8268-x 复制DOI
    作者列表:Ogren M,Baranowski T,Lowry SJ,Mendoza JA
    BACKGROUND & AIMS: BACKGROUND:Accurately measuring parents' attitudes and beliefs regarding limiting their children's TV viewing is important to inform the design and evaluation of effective interventions. This manuscript assesses the internal consistency reliability, test-retest reliability, convergent validity, and construct validity of the Model of Goal Directed Behavior (MGDB) scales among parents of Latino preschoolers to characterize Latino parents' attitudes and beliefs toward limiting their preschoolers' TV viewing. METHOD:Participants included parents of Latino preschoolers in the United States, 3-5 years old (n = 186). Parents completed a socio-demographic survey and the 105-item MGDB questionnaire (Attitudes, Perceived Positive/Negative Behavioral Control, Subjective Norms, Positive and Negative Anticipated Emotions, Habits, Self-Efficacy, Desires, and Intentions surrounding their child's TV viewing) which was used to measure internal consistency reliability and construct validity. A subsample of participants completed the questionnaire twice to measure test-retest reliability. Further, parents completed a 7-day TV viewing diary for their preschooler, and a TV parenting practices questionnaire as measures of convergent validity. RESULTS:Internal consistency reliability was generally acceptable for the MGDB scales (Cronbach's alphas> 0.7), except for the Desires scale, which was revealed to have two factors and the Attitudes and Perceived Behavioral Control scales. Test-retest reliability over 2 months had negligible to moderate correlations (r's = 0.28 to 0.61). Two structural equation models were conducted. One yielded acceptable model fit (x2 (97) = 113.65, p = .119) and the other had questionable model fit (x2 (97) = 125.39; p = .028). Testing convergent validity, only two MGDB scales (Habits and Self-Efficacy) were positively correlated with the TV parenting practices questionnaire (r's = 0.33 to 0.51), and none were meaningfully correlated with preschoolers' mean daily TV viewing. CONCLUSIONS:Initial reliability and validity for some of the MGDB scales appear acceptable among parents of Latino preschoolers. Refinement of the instrument and testing among larger samples is necessary to fully evaluate psychometric properties. This instrument may be useful for characterizing Latino parents' attitudes and beliefs toward limiting their preschoolers' TV viewing and informing future TV reduction interventions. TRIAL REGISTRATION:Clinical Trials NCT01216306 Registered October 6, 2010.
    背景与目标:
  • 【使用Cybex II等速测力计测试健康成年女性膝盖伸肌和屈肌的可靠性。】 复制标题 收藏 收藏
    DOI:10.2519/jospt.1991.14.1.37 复制DOI
    作者列表:Molczyk L,Thigpen LK,Eickhoff J,Goldgar D,Gallagher JC
    BACKGROUND & AIMS: :This work was supported by NIH grant P50-AR39221. The purpose of this study was to determine intraobserver and interobserver reliability using a Cybex II isokinetic dynamometer for the measurement of isometric and isokinetic strength (0, 60, 180, 300 degrees /sec) and endurance (240 degrees /sec) in 20 healthy untrained females. Subjects were randomly assigned to one of two testers. Intraobserver reliability coefficients for tester one and tester two ranged from 0.89 to 0.98 and 0.72 to 0.97, respectively. Interobserver reliability coefficients ranged from 0.69 to 0.95 across speeds and movements. Although the testers differed significantly in reliability (p < 0.01), no systematic differences in measured levels of performance were noted between testers one and two for the majority of the measurements. The results indicate that comparable muscular strength and endurance values can be obtained from knee extension/flexion tests at various speeds using a Cybex II isokinetic dynamometer. Comparable values were obtained whether subjects were tested by a single tester on two separate occasions or whether subjects were tested by a different tester on two separate occasions. J Orthop Sports Phys Ther 1991;14(1):37-41.
    背景与目标: : 这项工作得到了NIH赠款P50-AR39221的支持。这项研究的目的是使用Cybex II等速测力计确定观察者内和观察者间的可靠性,以测量20名未受过训练的健康女性的等速和等速强度 (0、60、180、300度/秒) 和耐力 (240度/秒)。受试者被随机分配到两个测试者之一。测试仪1和测试仪2的观察者内可靠性系数分别为0.89至0.98和0.72至0.97。观察者间可靠性系数在速度和运动之间的范围从0.69到0.95。尽管测试人员在可靠性方面存在显着差异 (p <0.01),但对于大多数测量,测试人员1和2之间的性能测量水平没有系统差异。结果表明,使用Cybex II等速测力计,可以通过各种速度的膝盖伸展/屈曲测试获得可比的肌肉力量和耐力值。无论受试者是在两个不同的场合由单个测试者测试,还是在两个不同的场合由不同的测试者测试,都获得了可比的值。J Orthop体育物理1991;14(1):37-41。
  • 【神经心理学测量的稳健可靠性: 重测相关性的荟萃分析。】 复制标题 收藏 收藏
    DOI:10.1080/13854046.2013.809795 复制DOI
    作者列表:Calamia M,Markon K,Tranel D
    BACKGROUND & AIMS: :Test-retest reliability is an important psychometric property relevant to assessment instruments typically used in neuropsychological assessment. This review presents a quantitative summary of test-retest reliability coefficients for a variety of widely used neuropsychological measures. In general, the meta-analytic test-retest reliabilities of the test scores ranged from adequate to high (i.e., r=.7 and higher). Furthermore, the reliability values were largely robust across factors such as age, clinical diagnosis, and the use of alternate forms. The values for some of the memory and executive functioning scores were lower (i.e., less than r=.7). Some of the possible reasons for these lower values include ceiling effects, practice effects, and across time variability in cognitive abilities measured by those tests. In general, neuropsychologists who use these measures in their assessments can be encouraged by the magnitude of the majority of the meta-analytic test-retest correlations obtained.
    背景与目标: : 重测信度是与神经心理学评估中通常使用的评估工具相关的重要心理测量属性。这篇综述对各种广泛使用的神经心理学测量方法的重测信度系数进行了定量总结。通常,测试分数的荟萃分析重测可靠性范围从足够到高 (即r =.7及更高)。此外,可靠性值在年龄,临床诊断和替代形式的使用等因素中均具有很大的鲁棒性。一些记忆和执行功能得分的值较低 (即,小于r =.7)。这些较低值的一些可能原因包括天花板效应,实践效应以及通过这些测试测量的认知能力的跨时间变化。通常,在评估中使用这些措施的神经心理学家可以通过获得的大多数荟萃分析测试-重测相关性的程度来鼓励他们。
  • 【登革热的四种快速免疫层析测试的诊断参数和可靠性4。】 复制标题 收藏 收藏
    DOI:10.1016/j.bjid.2019.12.004 复制DOI
    作者列表:Mata VE,Passos SRL,Santos MABD,Buonora SN,de Andrade CAF,Lima MDRQ,Costa BM,Hökerberg YHM
    BACKGROUND & AIMS: BACKGROUND:Although performance of rapid immunochromatographic tests (RITs) for dengue virus (DENV) serotypes 1, 2 and 3 is relatively settled, evidence on accuracy of RITs for DENV-4 are based on studies with small sample sizes and with discrepant results. OBJECTIVES:To assess accuracy and inter-observer agreement of RITs targeting dengue nonstructural protein-1 (NS1) antigen - Dengue NS1-Bioeasy™, Dengue NS1 Ag Strip-Bio-Rad™, IVB Dengue Ag NS1-Orangelife™ and Dengue NS1-K130-Bioclin™ in DENV-4 samples. METHODS:Study sample (n = 324) included adults presenting at an emergency unit in Rio de Janeiro, Brazil, with fever of ≤72 h and two or more dengue symptoms. A serum sample from each patient was tested by each RIT. A positive reverse-transcription polymerase chain reaction was considered as the reference standard for dengue diagnosis. The diagnostic parameters analyzed for each RIT were sensitivity, specificity, positive and negative predictive values, and likelihood ratios. Each RIT was read by homogeneous (two junior nurses) or heterogeneous (one junior nurse and one senior biologist) pairs. Agreement was estimated by simple kappa with 95% confidence interval, positive (Ppos) and negative (Pneg) proportion concordance and prevalence and bias adjusted kappa, rated from poor (k < 0.0) to almost perfect (0.8 < k < 1.0), and perfect (k = 1). RESULTS:NS1 RITs for DENV-4 diagnosis showed high specificity (95.9%-99.4%), but low sensitivity (14.7%-45.4%). Bioeasy™ had the best performance, with a positive likelihood ratio of 26.0 (95% CI: 8.4;81.0). Inter-observer agreement was almost perfect for all evaluated RITs. Mismatches in confirmed dengue were more common for the Bioclin™ (Ppos 88.3-90.0 %) and Orangelife™ (Ppos 91.7-94.1 %) tests. CONCLUSIONS:For DENV-4, the tested RITs had high specificity, but lower sensitivity compared to published results for other serotypes. They should not be used for screening purposes. Different brands may have very different performances. This should be considered upon deciding of using RITs in DENV-4 outbreaks.
    背景与目标:
  • 【用于测量绝经后妇女血液雄激素和雌激素的市售直接放射免疫测定法的可靠性和有效性。】 复制标题 收藏 收藏
    DOI: 复制DOI
    作者列表:Rinaldi S,Déchaud H,Biessy C,Morin-Raverot V,Toniolo P,Zeleniuch-Jacquotte A,Akhmedkhanov A,Shore RE,Secreto G,Ciampi A,Riboli E,Kaaks R
    BACKGROUND & AIMS: :In large-scale epidemiological studies on endogenous sex steroids and cancer risk, direct immunoassays of circulating hormone levels have the advantage of being fast and comparatively inexpensive while requiring only small sample volumes. On the other hand, indirect assays after organic extraction and chromatographic prepurification have the advantage of reducing specific interferences and matrix effects and hence are thought to have better validity. We compared direct assays of testosterone (T, six different assays), Delta4-androstenedione (A, four assays), estrone (E(1), one assay), and 17beta-estradiol (E(2), five assays) with measurements obtained by an indirect assay in a representative subset of 20 postmenopausal women who were part of a large prospective cohort study. Within-batch reproducibilities of the subject rankings by relative hormone levels were good (intraclass correlations >0.89) for all direct assays tested. Between batches, reproducibilities generally were also acceptable (r > 0.80) to good (r > 0.90) in terms of Pearson's correlations. The between-batch reproducibility in terms of intraclass correlations was systematically lower in terms of Pearson's correlations, however, because of between-batch variations in the absolute scale of measurements. The relative validity of direct versus indirect assays in terms of the subjects' ranking by relative hormone levels was also high for most of the kits tested for T, A, and E(1) (Pearson's correlations between 0.70 and 0.89) but was high for only two kits of five tested for E(2) (correlations of 0.86 and 0.84). On an absolute scale, mean measurement values were generally higher for direct assays than for the indirect assay and, for each hormone, varied substantially, depending on the kit used. Overall, the results of this study show that, with careful selection, commercial kits for direct radioimmunoassays of steroid hormones in postmenopausal serum can be found that may allow a reliable estimation of relative risks in epidemiological studies. However, standardization of the absolute scale of assays remains problematic.
    背景与目标: : 在有关内源性类固醇和癌症风险的大规模流行病学研究中,循环激素水平的直接免疫测定具有快速且相对便宜的优势,而仅需要少量样本量。另一方面,有机萃取和色谱预纯化后的间接测定具有减少特定干扰和基质效应的优势,因此被认为具有更好的有效性。我们比较了睾丸激素的直接测定 (T,六种不同的测定),Delta4-androstenedione (A,四种测定),雌酮 (E(1),一种测定) 和17β-雌二醇 (E(2),五个试验),通过间接试验在20名绝经后妇女的代表性子集中获得测量结果,这些妇女是一项大型前瞻性队列研究的一部分。对于所有测试的直接测定,受试者按相对激素水平排名的批内再现性良好 (类内相关性> 0.89)。在批次之间,就皮尔逊相关性而言,再现性通常也是可接受的 (r > 0.80) 至良好 (r> 0.90)。就类内相关性而言,批次间的可重复性在皮尔逊相关性方面系统地较低,但是,由于测量绝对规模的批次间变化。对于大多数测试的T,A,和E(1) (0.70和0.89之间的皮尔逊相关性),但对于仅针对E(2) 测试的五个试剂盒中的两个试剂盒 (0.86和0.84的相关性) 来说是很高的。在绝对规模上,直接测定的平均测量值通常高于间接测定的平均测量值,并且对于每种激素,根据所使用的试剂盒,其差异很大。总体而言,这项研究的结果表明,通过仔细选择,可以发现用于绝经后血清中类固醇激素直接放射免疫测定的商业试剂盒,可以在流行病学研究中可靠地估计相对风险。然而,测定绝对规模的标准化仍然存在问题。
  • 【动机评估量表的可靠性分析: 未能复制。】 复制标题 收藏 收藏
    DOI:10.1016/0891-4222(91)90031-m 复制DOI
    作者列表:Zarcone JR,Rodgers TA,Iwata BA,Rourke DA,Dorsey MF
    BACKGROUND & AIMS: :The Motivation Assessment Scale (MAS) has been proposed as an efficient questionnaire for identifying the source of reinforcement for an individual's self-injurious behavior (SIB). A previous reliability analysis of the MAS (Durand & Crimmins, 1988) reported interrater correlation coefficients ranging from .66 to .92, based on a comparison of responses provided by classroom teachers. In this study, the reliability of the MAS was reexamined with two independent groups of developmentally disabled individuals who exhibited SIB (N = 55). For the institutional sample (n = 39), the MAS was given to two staff members (a supervisor and therapy aide) who work with the individual daily. For the school sample (n = 16), the MAS was given to the teacher and teacher's aide who taught the student. The correlational analyses completed by Durand and Crimmins (1988) were repeated; in addition, a more precise analysis of interrater reliability was calculated based on the actual number of scoring agreements between the two raters. Results showed that only 16 of the 55 raters agreed on the category of reinforcement maintaining their client's or student's SIB, that only 15% of the correlation coefficients obtained were above .80, and that none of the reliability scores based on percent agreement between raters was above 80%.
    背景与目标: : 动机评估量表 (MAS) 已被提议作为一种有效的问卷,用于确定个人自残行为 (SIB) 的强化来源。先前对MAS的可靠性分析 (Durand & Crimmins,1988) 报告的评分者之间的相关系数范围为0.66至0.92,这是基于对课堂老师提供的响应的比较。在这项研究中,对两组独立的发育障碍者表现出SIB (N = 55),重新检查了MAS的可靠性。对于机构样本 (n = 39),将MAS分配给每天与个人一起工作的两名工作人员 (一名主管和治疗助手)。对于学校样本 (n = 16),MAS被授予教学生的老师和老师的助手。重复了由Durand和Crimmins (1988) 完成的相关分析; 此外,根据两个评分者之间的评分协议的实际数量,计算了评分者间可靠性的更精确分析。结果表明,在55个评分者中,只有16个同意维持其客户或学生的SIB的强化类别,获得的相关系数中只有15% 高于.80,并且基于评分者之间的百分比一致性的可靠性得分均不高于80%。
  • 【管腔乳腺癌核心针活检中Ki67-Labelling指数的可靠性不受活检体积的影响。】 复制标题 收藏 收藏
    DOI:10.1245/s10434-016-5730-1 复制DOI
    作者列表:Focke CM,Decker T,van Diest PJ
    BACKGROUND & AIMS: BACKGROUND:Assessing prognostic and predictive factors like the Ki67 labelling index (Ki67-LI) in breast cancer core needle biopsies (CNB) may be hampered by undersampling. Our aim was to arrive at a representative assessment of Ki67-LI in CNB of luminal breast cancers by defining optimal cutoffs and establishing the minimum CNB volume needed for highest concordance of Ki67-LI between CNB and subsequent surgical excision biopsy (SEB). METHODS:We assessed the Ki67-LI in CNB and subsequent SEB of 170 luminal breast cancers according to two counting methods recommended by the International Ki67 in Breast Cancer Working Group and applied the cutoffs to distinguish low and high proliferation given by the St Gallen 2013 and 2015 consensus, respectively. We then compared CNB volume characteristics for cases with concordant and discordant Ki67-LI between CNB versus SEB. RESULTS:Highest concordance (75%, κ = 0.44) between CNB and SEB was achieved using the method that assesses the average tumor Ki67-LI and a cutoff of 20%. No significant differences were found between cases with concordant and discordant Ki67-LI in CNB versus SEB for number of biopsy cores, total core length, tumor tissue length, or total CNB or tumor tissue area size in the CNB for two various cutoffs. CONCLUSIONS:A concordance of 75% between CNB and SEB can be achieved for the Ki67-LI using a method assessing average Ki67-LI at the threshold of 20%. Increasing CNB volume did not result in improved agreement rates, indicating that reliability of Ki67 levels in CNB of luminal breast cancers is unaffected by CNB volume.
    背景与目标:

+1
+2
100研值 100研值 ¥99课程
检索文献一次
下载文献一次

去下载>

成功解锁2个技能,为你点赞

《SCI写作十大必备语法》
解决你的SCI语法难题!

技能熟练度+1

视频课《玩转文献检索》
让你成为检索达人!

恭喜完成新手挑战

手机微信扫一扫,添加好友领取

免费领《Endnote文献管理工具+教程》

微信扫码, 免费领取

手机登录

获取验证码
登录