Research design is the master blueprint of any academic investigation. Before you upload a single document to ResearchLens, you need a clearly articulated research design that specifies what you are studying, why it matters, how you will gather evidence, and how you will interpret findings. The design choice is not stylistic — it is epistemological. It reflects your assumptions about reality, knowledge, and the legitimate ways of producing scholarly truth.
ResearchLens supports all four major research orientations used in social science, education, public health, military science, and the humanities. Each approach has rigorous standards, distinct workflows, and specific outputs that the platform is designed to produce.
Choosing the Right Method
Qualitative research is grounded in the assumption that reality is socially constructed and that meaning emerges through human interaction with the world. Rather than measuring phenomena, qualitative researchers seek to interpret them. The data are words, narratives, observations, and artifacts — not numbers.
The epistemological foundation is typically constructivism (Creswell & Poth, 2018) or interpretivism (Lincoln & Guba, 1985), which holds that the researcher and participant co-construct meaning. This means the researcher's positionality, reflexivity, and transparency are not flaws — they are methodological requirements that must be disclosed and addressed.
Thematic Analysis: Step-by-Step
Braun and Clarke's (2006) thematic analysis is the most widely used qualitative analytical framework and is the primary engine behind ResearchLens's qualitative coding module.
-
1
Familiarize Yourself with the DataRead and re-read your corpus of data. Note initial ideas. In ResearchLens, this means uploading your documents and reviewing the extracted text in the context pane before coding begins. Mark initial observations as memos or annotations.
-
2
Generate Initial CodesSystematically code interesting features of the data. Codes are short labels that capture the essence of a data segment. In ResearchLens, add your codes in the "Theme Codes" input field. Each code should be grounded in the data, not pre-imposed from theory (unless you are conducting a theory-driven deductive analysis).
-
3
Search for ThemesCluster related codes into potential themes. A theme captures something important about the data in relation to your research question. Themes must be substantive — they are not simply topics but patterns of shared meaning. ResearchLens will show you code co-occurrence and frequency distributions to assist this process.
-
4
Review ThemesRefine themes against the full dataset. A theme is valid only if there is sufficient data to support it and it is clearly distinguishable from other themes. Some themes may collapse; new ones may emerge. This is iterative, not linear.
-
5
Define and Name ThemesWrite a clear definition of each theme and choose a compelling name that captures its essence. The name should convey what the theme is about to a reader who has not seen your data.
-
6
Produce the Written ReportUse ResearchLens's APA Report Generator to produce a formatted report. Qualitative reports require vivid, analytic narrative that weaves together data extracts with interpretive commentary. Avoid description for its own sake — every data quote must be analytically justified.
Content analysis is a systematic, replicable technique for compressing many words of text into fewer content categories based on explicit rules of coding (Krippendorff, 2018). It bridges qualitative and quantitative traditions — the process of identifying codes and themes is interpretive, but the resulting data (frequencies, proportions, co-occurrences) are quantitative.
Content analysis is appropriate when you want to analyze communication artifacts: policy documents, news articles, social media posts, transcripts, training manuals, advertisements, or published research. The unit of analysis can be a word, phrase, sentence, paragraph, document, or image.
Content Analysis Workflow in ResearchLens
-
1
Define Your Unit of AnalysisDecide whether you are coding individual words, sentences, paragraphs, or whole documents. ResearchLens operates at the word/phrase level for frequency analysis and at the sentence/passage level for thematic coding. Document this decision in your Methods section — it must be replicable.
-
2
Develop Your Coding SchemeYour coding scheme maps content categories to text evidence. Categories must be exhaustive (every relevant unit can be coded), mutually exclusive (no unit belongs to two categories simultaneously), and reliably applicable by more than one coder. Enter your categories as theme codes in ResearchLens.
-
3
Run Frequency AnalysisResearchLens will compute keyword frequency, show the top N terms, and generate a frequency chart. Cross-reference with your coding scheme. High-frequency terms that fall outside your scheme may indicate gaps in your categories.
-
4
Calculate Inter-Rater ReliabilityIf two coders are applying the same scheme, measure agreement. Cohen's Kappa (κ) above .70 indicates acceptable reliability; above .80 is strong. ResearchLens provides export functions — use the CSV export to calculate κ in a spreadsheet or statistical tool. Document your IRR in the Methodology section.
-
5
Interpret and Report FindingsInterpret what the frequency distribution and coding results mean in relation to your research question. Content analysis findings require both description (what the data show) and interpretation (what the patterns mean). Use the ResearchLens APA report generator to structure these sections correctly.
Quantitative research operates within a post-positivist paradigm that assumes a single, objective reality that can be measured with precision. The goal is to test hypotheses derived from theory, measure relationships between variables, establish causation or correlation, and generalize findings to a broader population (Creswell & Creswell, 2023).
In quantitative research, rigor is established through validity (does your instrument measure what it claims?), reliability (does it produce consistent results?), and statistical power (is your sample large enough to detect real effects?). These are non-negotiable — ResearchLens's quantitative module is designed to help you surface these metrics from your survey CSV data.
Descriptive Statistics
- Mean, median, mode (central tendency)
- Standard deviation, variance (spread)
- Range, min, max, percentiles
- Frequency distributions and histograms
- Skewness and kurtosis for normality
Inferential Statistics
- t-tests (compare two group means)
- ANOVA (compare three or more groups)
- Chi-square (categorical relationships)
- Pearson/Spearman correlation
- Regression analysis (prediction)
Quantitative Analysis Workflow
-
1
Define Variables and HypothesesIdentify your independent variable (IV), dependent variable (DV), and any covariates. Write null and alternative hypotheses in formal notation: H₀: μ₁ = μ₂ vs. H₁: μ₁ ≠ μ₂. Enter these as Research Questions in ResearchLens — the platform's RQ Alignment Score will later tell you how strongly your data responded to each question.
-
2
Clean and Prepare Your DataRemove duplicates, handle missing values, check for outliers, and verify that all numeric fields are coded consistently. ResearchLens reads CSV files directly. Ensure column headers are clean — no special characters, spaces replaced with underscores, numeric Likert scales encoded as integers not text (e.g., 1–5, not "Strongly Agree").
-
3
Run Descriptive StatisticsRun frequency analysis to see distributions. Review the automated charts. Look for skewed distributions that may violate assumptions of parametric tests. A heavily skewed Likert distribution may require non-parametric alternatives (e.g., Mann-Whitney U instead of t-test).
-
4
Interpret the RQ Alignment ScoreResearchLens computes a Research Question Alignment Score (0–100) that measures how strongly your document or data content aligns with each research question. In quantitative work, treat this as a preliminary alignment check. Scores below 30 may indicate the data set does not adequately address the RQ and warrants methodological reconsideration.
-
5
Export and Complete Statistical AnalysisExport your cleaned dataset from ResearchLens as CSV, then run inferential statistics in SPSS, R, or Python. ResearchLens handles descriptive, frequency, and content-level quantitative analysis. Full inferential testing requires a dedicated statistical environment. Document the entire pipeline in your methodology section.
Mixed methods research is not simply using both a survey and an interview in the same project. It is a methodological paradigm in its own right, grounded in pragmatism, that intentionally integrates qualitative and quantitative data in ways that produce insights neither strand could generate alone (Creswell & Plano Clark, 2018). Integration — the philosophical core of mixed methods — must occur at the design level, data collection level, analysis level, or all three.
The choice of design is determined by the sequence of data collection (concurrent or sequential), the priority of each strand (equal weight or one strand dominant), and the point of integration (data collection, analysis, or interpretation). These decisions must be explicit in your dissertation or research proposal.
Core Mixed Methods Designs
Quantitative data is collected and analyzed first. Results that are surprising, significant, or unclear are then explained through qualitative follow-up. Priority sits with the quantitative strand. This is the most common design in dissertation research.
Example: Survey 500 veterans on retention rates (QUAN). Follow up with 12 interviews to explain why high-scoring regions unexpectedly had lower retention (qual).
Qualitative data is collected first to explore an understudied phenomenon. Findings then inform the development of a survey instrument or intervention that is tested quantitatively with a larger sample. Used when validated instruments do not yet exist for the population or construct.
Example: Interview 15 distance learning coordinators to identify technology adoption barriers (qual). Develop and administer a validated survey to 400 coordinators based on the emergent themes (QUAN).
Both strands are collected independently at the same time with equal priority. Results are compared, merged, or contrasted in the interpretation phase to create a fuller picture. Used when the researcher wants multiple perspectives on the same phenomenon simultaneously.
Example: Simultaneously survey 300 Army recruits on enlistment motivations (QUAN) and interview 20 recruits about their decision-making experience (qual). Merge to assess convergence and divergence.
Step-by-Step: From Upload to APA Report
NEW: Codebook Builder v2 — Full 8-Field Academic Export
NEW: Manual Coding Mode v2 — Interactive Two-Pane Workspace
NEW: Codebook Builder v2 — Full 8-Field Academic Export
NEW: Manual Coding Mode v2 — Full Interactive Workspace
Supported File Types and Their Use Cases
| File Type | Best For | Method Alignment | Notes |
|---|---|---|---|
| PDF (text-layer) | Academic articles, policy documents, reports, transcripts | Qualitative Content | Direct text extraction. Formatting stripped. Best quality extraction. |
| PDF (scanned) | Historical documents, handwritten notes, printed forms | Qualitative Content | OCR via Tesseract.js. Quality depends on scan resolution. 300 DPI minimum recommended. |
| DOCX | Interview transcripts, research memos, literature reviews | Qualitative Mixed | Full content including tables. Best format for research documents created in Word. |
| CSV | Survey data, Likert scales, behavioral logs, enrollment records | Quantitative Mixed | Numeric columns auto-detected. First row must be column headers. UTF-8 encoding required. |
The Research Integrity Checklist
- Declare your positionality — In qualitative and mixed methods research, disclose your relationship to the topic, population, and data. Your perspective shapes interpretation. This is not a flaw; undisclosed positionality is the flaw.
- Obtain IRB approval — Any research involving human participants (including analysis of their documents or survey responses) requires Institutional Review Board review. Complete this before uploading any participant data to ResearchLens.
- De-identify data before uploading — Remove names, student IDs, social security numbers, and any direct identifiers from documents before uploading. ResearchLens runs locally, but data hygiene is your responsibility as the researcher.
- Match your method to your epistemology — If you believe knowledge is constructed socially, using a purely quantitative design creates an internal contradiction. Your worldview, methodology, and methods must be philosophically consistent (Creswell & Poth, 2018).
- Triangulate your findings — Never rely on a single data source or method. Cross-reference your ResearchLens outputs with external literature, participant feedback, or peer review of your coding scheme.
- Audit your coding trail — Document every decision you make about code creation, modification, and deletion. ResearchLens's export functions allow you to capture snapshots of your analysis at multiple stages. Maintain these as appendices.
- Do not cherry-pick findings — Report disconfirming data, outliers, and alternative interpretations. Academic integrity requires presenting findings that challenge your hypothesis with the same rigor as findings that support it.
- Do not generalize beyond your design — Qualitative findings from 12 interview participants cannot be generalized to a population. Quantitative findings from a convenience sample cannot be generalized beyond the sample's characteristics. State limitations explicitly.
Methodology Comparison at a Glance
| Criterion | Qualitative | Content Analysis | Quantitative | Mixed Methods |
|---|---|---|---|---|
| Paradigm | Constructivism / Interpretivism | Both / Pragmatic | Post-positivism | Pragmatism |
| Data Type | Words, narratives | Text, communication artifacts | Numbers, variables | Both |
| Sample Size | Small, purposeful | Variable (corpus-based) | Large, representative | Depends on design |
| Rigor Standard | Trustworthiness (Lincoln & Guba) | IRR, systematic coding | Validity, reliability, power | Integration quality |
| ResearchLens Tool | Theme coder, context review | Frequency analysis, coding scheme | CSV analysis, RQ scoring | All modules combined |
| Typical Output | Themes, narratives, quotes | Frequency tables, coding maps | Statistics, charts, p-values | Integrated findings |
Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology, 3(2), 77–101. https://doi.org/10.1191/1478088706qp063oa
Creswell, J. W., & Creswell, J. D. (2023). Research design: Qualitative, quantitative, and mixed methods approaches (6th ed.). SAGE Publications.
Creswell, J. W., & Plano Clark, V. L. (2018). Designing and conducting mixed methods research (3rd ed.). SAGE Publications.
Creswell, J. W., & Poth, C. N. (2018). Qualitative inquiry and research design: Choosing among five approaches (4th ed.). SAGE Publications.
Krippendorff, K. (2018). Content analysis: An introduction to its methodology (4th ed.). SAGE Publications.
Lincoln, Y. S., & Guba, E. G. (1985). Naturalistic inquiry. SAGE Publications.
Merriam, S. B., & Tisdell, E. J. (2016). Qualitative research: A guide to design and implementation (4th ed.). Jossey-Bass.
Neuendorf, K. A. (2017). The content analysis guidebook (2nd ed.). SAGE Publications.
Patton, M. Q. (2015). Qualitative research and evaluation methods (4th ed.). SAGE Publications.
Yin, R. K. (2018). Case study research and applications: Design and methods (6th ed.). SAGE Publications.