Veracity User Guide

Learn how to use the Veracity platform for citation checking.

1. Introduction

Purpose of This Document

This guide is intended for advanced users—editors, peer reviewers, and other “super users”—to understand the deeper philosophies behind Veracity’s citation checking tools and to interpret the results effectively. While Veracity offers powerful automation to streamline scholarly review, human judgment remains vital. This documentation will equip you with the context and best practices to derive maximum value from our system.

The Need for Citation Checking

In academic and professional fields such as science and law, citations are the bedrock of trust and traceability. Yet, research consistently shows that up to 25% of citations in the general scientific literature are inaccurate (Peoples & Østbye, BMJ, 2023). With the increased use of AI in text generation—where hallucination rates for citations can climb to 55% (Walters & Wilder, 2023)—the potential for further degradation in citation quality looms large.

Veracity addresses this growing concern by automating the process of checking and validating citations. Our aim is to raise the bar on quality and integrity, ensuring that authors, editors, and peer reviewers can confidently assess the veracity of referenced works.

2. Philosophy and Core Principles

  1. Integrity and Trust
    Veracity’s chief goal is to uphold scholarly integrity. We offer tools to ensure citations are both accurate and contextually appropriate, reinforcing the academic and professional trust chain.

  2. Augmenting Human Judgment
    Automation can reduce manual workload and highlight potential issues, but final decisions about the accuracy and relevance of a citation ultimately rest with human experts.

  3. Transparency and Accountability
    By flagging inconsistencies or errors, Veracity promotes a transparent environment where issues can be corrected early in the publication cycle, thereby elevating overall quality.

3. Understanding Veracity’s Citation Checking Process

Veracity’s process consists of several distinct but interrelated steps, each designed to detect and flag different types of citation issues.


3.1. Extraction

  1. Identification of In-Text Citations
    We scan the document to locate every citation within the main text. This includes numeric, author-year, and other citation styles.

  2. Bibliographic References
    We parse the references section (or bibliography) to capture titles, authors, DOIs, and other metadata.

  3. Normalizing Citation Data
    Our system converts all extracted elements into a standardized format, making subsequent comparisons more reliable and easier to interpret.

  4. Internal Consistency Checks

    • Citation-Reference Matching: Ensures each in-text citation correctly points to the intended reference in the bibliography.
    • Unique Labels and Identifiers: Verifies correct labeling of references (e.g., “Smith et al. 2004a vs. Smith et al. 2004b” in author-year systems).

Outcome: A structured dataset linking each in-text citation to its corresponding reference, ready for further analysis.


3.2. Source Resolution and Verification

  1. Source Matching
    Veracity queries third-party indices (e.g., Crossref, Semantic Scholar) and search engines (e.g., Google, Brave) to locate and confirm each cited source.

  2. Metadata Validation
    Crosschecks the author-provided metadata (title, authors, publication year, DOI) with what is found in external databases, flagging discrepancies.

  3. Retraction and Editorial Notices
    Checks the retraction status of cited works using services like Retraction Watch. Any editorial notices (e.g., corrections, expressions of concern) are also highlighted.

  4. Content Gathering
    Where possible, Veracity retrieves the full text or abstract of the cited source to enable deeper relevance and support analysis.

Outcome: Each cited reference is verified for existence, metadata accuracy, and editorial status. Any retractions or major inconsistencies are flagged for further review.


3.3. Relevance Check

Using both the extracted content and any additional data gathered, Veracity evaluates the relationship between each in-text statement and the cited source.

  1. Custom AI Models
    Proprietary machine learning models trained on curated citation-data sets to determine how well a reference supports or relates to the in-text claim.

  2. Scoring System

    • 1.0 (High Relevance): The cited source clearly supports or is directly relevant to the statement.
    • 0.0 (No Relevance): The source appears to have no relation to the statement’s subject matter.
    • Intermediate Scores (0.1–0.9): Indicate degrees of uncertainty or partial relevance.
  3. Examples

    • Clearly Relevant (Score: 1.0):
      • Statement: “The authors in [5] proposed a method to segment and classify brain tumors…”
      • Reference: Matches a paper discussing precisely that method.
    • No Relevance (Score: 0):
      • Statement: Discusses treatment principles for cerebral palsy.
      • Reference: Cites a blockchain and edge computing study.
    • Gray Areas: Might require human judgment for final confirmation.
  4. Context Limitations
    If full-text access is restricted, Veracity relies on title, abstract, and metadata to make a preliminary determination. Human reviewers can then investigate further.

Outcome: A structured relevance score for each citation, plus recommendations for further action or clarity where needed.


Below is a concise way to rename and organize your checks section under the heading “Summary of Checks.” This approach is clear to readers while preserving the idea of grouping checks by what they apply to (the entire document, individual citations, references, etc.).


Example Integration in Your Documentation

3.5. Summary of Checks

To help you quickly see all the checks Veracity performs, we’ve consolidated them here in one place, grouped by level of analysis. Some checks require the document to be fully parsed before running (e.g., whether all references match in-text citations); others can be run earlier in the workflow. Refer back to each step of the process (Extraction, Source Resolution, Relevance Check) to see where these checks typically happen.


3.5.1. Document-Level Checks

Check NameDescription
All Sources FoundEnsures that every reference has been matched to a real, identifiable source.
No Retracted ReferencesFlags any references linked to retracted articles or editorial notices.
All References Match MetadataVerifies that cited reference details (e.g., title, authors, year, DOI) align with the retrieved source metadata.
All References Cited in TextConfirms that every bibliographic entry is cited at least once in the text.
No High-Frequency Author CitationsIdentifies potential over-representation by authors cited above a set frequency threshold.
No Self-Citation Above ThresholdChecks whether self-citations exceed a predefined threshold.
No Ambiguous Matching ReferencesEnsures that each in-text citation uniquely links to a single reference.
All In-text Citations RelevantProvides an aggregate relevance assessment, confirming in-text citations point to pertinent sources.

Use these checks to get a high-level sense of the document’s overall integrity. For example, an abundance of self-citations can signal potential bias, while retracted references may indicate outdated or invalid research.


3.5.2. In-Text Citation Checks

Check NameDescription
In-text Citation Has Matching ReferenceConfirms that each citation in the text points to a unique entry in the bibliography.
In-text Citation Not AmbiguousVerifies that the citation unambiguously refers to exactly one source.
In-text Citation RelevantAssesses whether the cited source is contextually relevant to the statement being made.
In-text Citation Supports StatementDetermines whether the source contains specific evidence or arguments that back the claim.

These checks happen during or after Extraction and are refined by the Relevance Check. If Veracity flags a citation for irrelevance, consider whether the source truly supports the text’s claims or if a different citation is needed.


3.5.3. Reference Checks

Check NameDescription
Source FoundConfirms that a referenced source can be accessed via external databases or search engines.
No Editorial NoticesVerifies that the source is not marked as retracted or under editorial concern.
Matches Source MetadataCompares the metadata provided (title, authors, publication details) to external databases.
Reference Cited in TextEnsures that each bibliographic entry is used at least once in an in-text citation.

These checks typically finalize after Source Resolution, when Veracity attempts to locate and verify each reference. If metadata discrepancies arise, the user may need to confirm the correct citation details (e.g., updated DOIs, correct spelling of authors).


3.5.4. Author Checks

Check NameDescription
Author Representation CheckAnalyzes patterns in the reference list to detect if certain authors are cited an unusually high number of times.
Author Previous Retractions CheckIdentifies whether any cited authors have a history of retractions or corrections in their publication record.

These checks rely on fully identifying authors across all references. They help detect potential conflicts of interest or bias, such as a paper heavily self-citing or repeatedly citing authors with a track record of retractions.



3.4. Quick vs. Deep Checks

Veracity offers two modes of operation to suit varying resource constraints and use cases:

  • Quick Checks

    • Faster, lighter AI models and limited content analysis.
    • Ideal for large-scale screenings, quick triage, or preliminary editorial reviews.
  • Deep Checks

    • Leverages more sophisticated AI models and thorough content analysis.
    • Recommended for final peer reviews, high-stakes publications, or compliance-sensitive industries.

Outcome: Organizations can balance cost and efficiency against the depth of citation scrutiny desired.


4. Interpreting Veracity Results

When Veracity completes its analysis, it provides a structured report summarizing:

  1. Internal Consistency

    • How many citations are properly matched vs. unmatched or mislabeled.
  2. Source Verification

    • Which references were found, confirmed, and consistent; which were retracted or flagged.
  3. Relevance Scoring

    • A relevance score for each citation, indicating how well the cited reference supports the statement.
  4. Suggested Actions

    • High Confidence Issues (e.g., retracted source): Immediate correction recommended.
    • Potential Mismatch: Further review or author query.
    • Unclear Relevance: Might require additional context or an editorial decision.

Common Error Types

  • Citation Mismatch (Wrong Reference): The text claims one topic, but the reference is unrelated.
  • Metadata Discrepancy: The year or title does not match external records, raising questions about source accuracy.
  • Retracted Source: The cited work has an editorial notice that must be addressed.
  • Ambiguous Relevance: The source might be tangentially related or lacking direct support.

6. FAQs and Troubleshooting

  • Q: What if Veracity doesn’t find a reference?
    A: It may be an obscure publication or incorrectly formatted reference. Try manual checks or confirm the source with the author.

  • Q: Will Veracity fix my references automatically?
    A: Veracity flags potential issues, but authors remain responsible for making corrections.

  • Q: Does Veracity store or share my manuscript content?
    A: We respect privacy and confidentiality. See our Privacy Policy for details.