Intra-Firm Information Flow: A Content-Structure Perspective
被引:0
|
作者:
Berchenko, Yakir
论文数: 0引用数: 0
h-index: 0
机构:
Bar Ilan Univ, Leslie & Susan Gonda Multidisciplinary Brain Res, IL-52900 Ramat Gan, Israel
Univ Cambridge, Dept Clin Vet Med, Madingley Rd, Cambridge CB3 0ES, EnglandBar Ilan Univ, Leslie & Susan Gonda Multidisciplinary Brain Res, IL-52900 Ramat Gan, Israel
Berchenko, Yakir
[1
,2
]
Daliot, Or
论文数: 0引用数: 0
h-index: 0
机构:
Hebrew Univ Jerusalem, Fac Law, IL-91905 Jerusalem, IsraelBar Ilan Univ, Leslie & Susan Gonda Multidisciplinary Brain Res, IL-52900 Ramat Gan, Israel
Daliot, Or
[3
]
Brueller, Nir N.
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, Fac Management, IL-69978 Tel Aviv, IsraelBar Ilan Univ, Leslie & Susan Gonda Multidisciplinary Brain Res, IL-52900 Ramat Gan, Israel
Brueller, Nir N.
[4
]
机构:
[1] Bar Ilan Univ, Leslie & Susan Gonda Multidisciplinary Brain Res, IL-52900 Ramat Gan, Israel
[2] Univ Cambridge, Dept Clin Vet Med, Madingley Rd, Cambridge CB3 0ES, England
[3] Hebrew Univ Jerusalem, Fac Law, IL-91905 Jerusalem, Israel
[4] Tel Aviv Univ, Fac Management, IL-69978 Tel Aviv, Israel
Natural language processing;
social network analysis;
information flow;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
This paper endeavors to bring together two largely disparate areas of research. On one hand, text mining methods treat each document as an independent instance despite the fact that in many text domains, documents are linked and their topics are correlated. For example, web pages of related topics are often connected by hyperlinks and scientific papers from related fields are typically linked by citations. On the other hand, Social Network Analysis (SNA) typically treats edges between nodes according to "flat" attributes in binary form alone. This paper proposes a simple approach that addresses both these issues in data mining scenarios involving corpora of linked documents. According to this approach, after assigning weights to the edges between documents, based on the content of the documents associated with each edge, we apply standard SNA and network theory tools to the network. The method is tested on the Enron email corpus and successfully discovers the central people in the organization and the relevant communications between them. Furthermore, Our findings suggest that due to the non-conservative nature of information, conservative centrality measures (such as PageRank) are less adequate here than non-conservative centrality measures (such as eigenvector centrality).
机构:
GREDEG, 250 Rue Albert Einstein, F-06560 Valbonne, France
Univ Nice Sophia Antipolis, F-06560 Valbonne, FranceGREDEG, 250 Rue Albert Einstein, F-06560 Valbonne, France
Ben Youssef, Adel
Hadhri, Walid
论文数: 0引用数: 0
h-index: 0
机构:
ADIS Univ Paris Sud, Sceaux, France
UAQUAP Univ Tunis, Sceaux, FranceGREDEG, 250 Rue Albert Einstein, F-06560 Valbonne, France
Hadhri, Walid
M'Henni, Hatem
论文数: 0引用数: 0
h-index: 0
机构:
ESC Univ La Manouba Campus, Univ Manouba, Tunis 2010, TunisiaGREDEG, 250 Rue Albert Einstein, F-06560 Valbonne, France