writing方面的記錄

2019-12-29 關(guān)于投一個(gè)benchmark dataset的文章:

(ICST 2019: BugsJS: a Benchmark of JavaScript Bugs)

JavaScript is a popular programming language that is also error-prone due to its asynchronous, dynamic, and loosely-typed nature. In recent years, numerous techniques have been proposed for analyzing and testing JavaScript applications. However, our survey of the literature in this area revealed that the proposed techniques are often evaluated on different datasets of programs and bugs【這個(gè)感覺(jué)站不住腳】. The lack of a commonly used benchmark limits the ability to perform fair and unbiased comparisons for assessing the efficacy of new techniques【好像也站不住腳妻熊,都在同一個(gè)dataset上比不就好了么,只能說(shuō)沒(méi)有一個(gè)a strong benchmark】. To fill this gap, we propose BugsJS, a benchmark of 453【也不大大呀】 real, manually validated JavaScript bugs from 10 popular JavaScript server-side programs, comprising 444k LOC in total. Each bug is accompanied by its bug report, the test cases that detect it, as well as the patch that fixes it. BugsJS features a rich interface for accessing the faulty and fixed versions of the programs and executing the corresponding test cases, which facilitates conducting highly-reproducible empirical studies and comparisons of JavaScript analysis and testing tools.

2019-12-13 國(guó)家公祭日:

In essence, our study empirically confirms and complements previous research findings (and common sense): Developers (and users) prefer documentation that is correct, complete, up to date, usable, maintainable, readable and useful.



Findings: Table VII shows the impact of using each of the

17 languages on the number of bug fixing commits in a

single-language (denoted as hlanguageiS) and multi-language

(denoted as hlanguageiM) setting. From the table, we can note

that the coefficients of the languages are not always statistically

significant. The statistically significant ones are marked with

one or multiple asterisks. There are 20 of them. For those that

are not statistically significant (i.e., 14 of them), unfortunately

not much conclusion can be drawn.

For some languages, the coefficient for the single-language

setting is significant, while the one for the multi-language

setting is not (four languages: CoffeeScript, Ruby, Erlang,

Haskell). For some other languages, it is the other way around

— the coefficient for the multi-language setting is significant,

while the one for the single-language setting is not (four languages:

C, Go, PHP, Python). For yet other languages, their coefficients

for both settings are not significant (three languages:

C#, JavaScript, Perl). Unfortunately, for such languages (11

languages), we cannot compare the two settings (i.e., singlelanguage

and multi-language), because the coefficient of at

least one of the settings is inconclusive.

Thus, we focus on languages with statistically significant

coefficients for both single and multi-language settings. We

find six languages with statistically significant coefficients:

C++, Objective-C, Java, TypeScript, Clojure, and Scala. For all

of them, we consistently find that their coefficients are larger

when they are used in a multi-language setting. This means

that there is a statistically significant support that using these

languages in a multi-language setting (rather than a singlelanguage

setting) increases bug proneness. The findings for the

other eleven languages do not refute the six languages, because

we can not conclude when coefficients are not statistically

significant.

Six languages including C++, Objective-C, Java, TypeScript,

Clojure, and Scala are more defect prone when they are used

with other languages. The results are inconclusive for the

other eleven languages.

1. in objective terms

2. 【可以作為將來(lái)的精讀文章】Patters of knowledge in API reference Documentation. TSE'13 by Martin. P. Robillard. 這篇文章主要對(duì)API reference Documentation(如jdk和.net的以api name為index的每一個(gè)webpage介紹該api的使用內(nèi)容啥的documentation)進(jìn)行content的分析。

對(duì)內(nèi)容進(jìn)行分析羡棵,主要是想知道一般的api documentation中包含了哪些內(nèi)容贺奠,其實(shí)如何組織的己单。具體的就是作者們花了大工夫先定義好了12中knowledge type(如api的功能是什么赛不,該如何使用等)睬辐,隨后分析了這些knowledge type在documentation中的分布按照type vs. method, classes vs. interface and member vs. variable的形式來(lái)進(jìn)行統(tǒng)計(jì)疗垛。同時(shí)輔助于frequent itemsets mining(使用R中的arules進(jìn)行統(tǒng)計(jì)的)症汹。

全文在寫(xiě)作或者實(shí)驗(yàn)方面:對(duì)最重要的第一步定義knowledge type的整個(gè)過(guò)程完全可以好好學(xué)習(xí)。對(duì)后面幾個(gè)較直觀(guān)的RQ的統(tǒng)計(jì)分析也是較常規(guī)的方法贷腕。對(duì)自己工作的意義方面寫(xiě)得較合理有說(shuō)服力背镇。 值得學(xué)習(xí)!

如果做類(lèi)似這樣的工作泽裳,里面的方法值得借鑒瞒斩!

------en...support evidence:

All newly developed applications have?bugs—some of them are quite?difficult to locate?because they exist within the coding logic, some are simply a matter of not

最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
  • 序言:七十年代末,一起剝皮案震驚了整個(gè)濱河市涮总,隨后出現(xiàn)的幾起案子胸囱,更是在濱河造成了極大的恐慌,老刑警劉巖瀑梗,帶你破解...
    沈念sama閱讀 217,277評(píng)論 6 503
  • 序言:濱河連續(xù)發(fā)生了三起死亡事件烹笔,死亡現(xiàn)場(chǎng)離奇詭異,居然都是意外死亡夺克,警方通過(guò)查閱死者的電腦和手機(jī)箕宙,發(fā)現(xiàn)死者居然都...
    沈念sama閱讀 92,689評(píng)論 3 393
  • 文/潘曉璐 我一進(jìn)店門(mén),熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)铺纽,“玉大人搀暑,你說(shuō)我怎么就攤上這事电湘∞挚” “怎么了搀捷?”我有些...
    開(kāi)封第一講書(shū)人閱讀 163,624評(píng)論 0 353
  • 文/不壞的土叔 我叫張陵,是天一觀(guān)的道長(zhǎng)其馏。 經(jīng)常有香客問(wèn)我凤跑,道長(zhǎng),這世上最難降的妖魔是什么叛复? 我笑而不...
    開(kāi)封第一講書(shū)人閱讀 58,356評(píng)論 1 293
  • 正文 為了忘掉前任仔引,我火速辦了婚禮扔仓,結(jié)果婚禮上,老公的妹妹穿的比我還像新娘咖耘。我一直安慰自己翘簇,他們只是感情好,可當(dāng)我...
    茶點(diǎn)故事閱讀 67,402評(píng)論 6 392
  • 文/花漫 我一把揭開(kāi)白布儿倒。 她就那樣靜靜地躺著版保,像睡著了一般。 火紅的嫁衣襯著肌膚如雪夫否。 梳的紋絲不亂的頭發(fā)上彻犁,一...
    開(kāi)封第一講書(shū)人閱讀 51,292評(píng)論 1 301
  • 那天,我揣著相機(jī)與錄音凰慈,去河邊找鬼汞幢。 笑死,一個(gè)胖子當(dāng)著我的面吹牛溉瓶,可吹牛的內(nèi)容都是我干的急鳄。 我是一名探鬼主播,決...
    沈念sama閱讀 40,135評(píng)論 3 418
  • 文/蒼蘭香墨 我猛地睜開(kāi)眼堰酿,長(zhǎng)吁一口氣:“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼!你這毒婦竟也來(lái)了张足?” 一聲冷哼從身側(cè)響起触创,我...
    開(kāi)封第一講書(shū)人閱讀 38,992評(píng)論 0 275
  • 序言:老撾萬(wàn)榮一對(duì)情侶失蹤,失蹤者是張志新(化名)和其女友劉穎为牍,沒(méi)想到半個(gè)月后哼绑,有當(dāng)?shù)厝嗽跇?shù)林里發(fā)現(xiàn)了一具尸體,經(jīng)...
    沈念sama閱讀 45,429評(píng)論 1 314
  • 正文 獨(dú)居荒郊野嶺守林人離奇死亡碉咆,尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛 以下內(nèi)容為張勛視角 年9月15日...
    茶點(diǎn)故事閱讀 37,636評(píng)論 3 334
  • 正文 我和宋清朗相戀三年抖韩,在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了。 大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片疫铜。...
    茶點(diǎn)故事閱讀 39,785評(píng)論 1 348
  • 序言:一個(gè)原本活蹦亂跳的男人離奇死亡茂浮,死狀恐怖,靈堂內(nèi)的尸體忽然破棺而出壳咕,到底是詐尸還是另有隱情席揽,我是刑警寧澤,帶...
    沈念sama閱讀 35,492評(píng)論 5 345
  • 正文 年R本政府宣布谓厘,位于F島的核電站幌羞,受9級(jí)特大地震影響,放射性物質(zhì)發(fā)生泄漏竟稳。R本人自食惡果不足惜属桦,卻給世界環(huán)境...
    茶點(diǎn)故事閱讀 41,092評(píng)論 3 328
  • 文/蒙蒙 一熊痴、第九天 我趴在偏房一處隱蔽的房頂上張望。 院中可真熱鬧聂宾,春花似錦愁拭、人聲如沸。這莊子的主人今日做“春日...
    開(kāi)封第一講書(shū)人閱讀 31,723評(píng)論 0 22
  • 文/蒼蘭香墨 我抬頭看了看天上的太陽(yáng)。三九已至蔚鸥,卻和暖如春惜论,著一層夾襖步出監(jiān)牢的瞬間,已是汗流浹背止喷。 一陣腳步聲響...
    開(kāi)封第一講書(shū)人閱讀 32,858評(píng)論 1 269
  • 我被黑心中介騙來(lái)泰國(guó)打工馆类, 沒(méi)想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留,地道東北人弹谁。 一個(gè)月前我還...
    沈念sama閱讀 47,891評(píng)論 2 370
  • 正文 我出身青樓乾巧,卻偏偏與公主長(zhǎng)得像,于是被迫代替她去往敵國(guó)和親预愤。 傳聞我的和親對(duì)象是個(gè)殘疾皇子沟于,可洞房花燭夜當(dāng)晚...
    茶點(diǎn)故事閱讀 44,713評(píng)論 2 354

推薦閱讀更多精彩內(nèi)容