conflibertR
An R package that brings ConfliBERT, a transformer model purpose-built for conflict and political violence text, into R. Run pretrained inference, benchmark models, fine-tune custom classifiers, and compare architectures, all from your R console.
Installation
# Install from GitHub
devtools::install_github("shreyasmeher/conflibertR")
# Set up the Python backend (run once, then restart R)
library(conflibertR)
conflibert_install()Named Entity Recognition
Extract persons, organizations, locations, weapons, and temporal expressions from conflict text.
ner_results <- conflibert_ner(
"The United Nations deployed peacekeepers to eastern Congo
after rebel forces attacked Goma on Tuesday."
)
ner_results
# Vectorized: pass multiple texts at once
ner_batch <- conflibert_ner(c(
"NATO forces launched airstrikes near Tripoli in March 2024.",
"President Zelenskyy met with General Syrskyi in Kyiv."
))Binary Classification
Determine whether text describes a conflict event. Returns a label, confidence score, and per-class probabilities.
classify_results <- conflibert_classify(c(
"Armed militants stormed a military outpost killing twelve soldiers.",
"The European Central Bank held interest rates steady at 4.5 percent.",
"Protesters clashed with riot police outside the parliament building.",
"A new trade agreement was signed between Japan and Australia."
))
classify_resultsMultilabel Event Classification
Score text against four event categories (Armed Assault, Bombing or Explosion, Kidnapping, and Other), each evaluated independently.
event_results <- conflibert_multilabel(c(
"A car bomb exploded near the government ministry in Baghdad.",
"Gunmen kidnapped three aid workers travelling through the Sahel region."
))
event_resultsQuestion Answering
Extract answers directly from conflict-related passages.
context <- "On 15 September 2023, ethnic clashes erupted in Manipur, India.
The Indian Army deployed two battalions to restore order.
At least 60 people were killed and over 200 injured in the violence."
conflibert_qa(context, "How many people were killed?")
conflibert_qa(context, "Who was deployed to restore order?")
conflibert_qa(context, "Where did the clashes occur?")Benchmarking
Evaluate the pretrained binary classifier against your own labeled data.
data <- conflibert_example("binary")
bench <- conflibert_benchmark(data$test$text, data$test$label)
benchFine-Tuning
Train a custom binary classifier on labeled data. The package ships with example datasets.
data <- conflibert_example("binary")
result <- conflibert_finetune(
train = data$train,
dev = data$dev,
test = data$test,
model = "ConfliBERT",
task = "binary",
epochs = 3
)
cat("Accuracy:", result$metrics$accuracy, "\n")
cat("F1 Score:", result$metrics$f1, "\n")Multiclass classification is also supported:
mc_data <- conflibert_example("multiclass")
mc_result <- conflibert_finetune(
train = mc_data$train,
dev = mc_data$dev,
test = mc_data$test,
model = "ConfliBERT",
task = "multiclass",
epochs = 3
)Model Comparison
Compare ConfliBERT against general-purpose transformers on the same task.
comparison <- conflibert_compare(
train = data$train,
dev = data$dev,
test = data$test,
models = c("ConfliBERT", "BERT Base Uncased", "DistilBERT Base"),
task = "binary",
epochs = 3
)
comparisonSee all available base models: conflibert_models()
Function Reference
| Function | Task |
|---|---|
conflibert_ner() |
Named entity recognition |
conflibert_classify() |
Binary conflict classification |
conflibert_multilabel() |
Multilabel event type scoring |
conflibert_qa() |
Extractive question answering |
conflibert_benchmark() |
Evaluate pretrained classifier |
conflibert_finetune() |
Train a custom classifier |
conflibert_compare() |
Compare multiple architectures |
conflibert_example() |
Load bundled example datasets |
conflibert_models() |
List available base models |
Citation
@inproceedings{hu2022conflibert,
title={ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence},
author={Hu, Yibo and Hosseini, MohammadSaleh and Parolin, Erick Skorupa
and Osorio, Javier and Khan, Latifur and Brandt, Patrick and D'Orazio, Vito},
booktitle={Proceedings of NAACL-HLT 2022},
pages={5469--5482},
year={2022}
}