Please enable JavaScript.

Coggle requires JavaScript to display documents.

ISQB Certified Tester Specialist Level Testing with Generative AI (CT…

- - - - Symbolic AI: Uses rule-based systems to imitate human decision-making
      - Classic ML: Data-driven approach for tasks such as defect categorization
      - Deep Learning: Uses neural networks to automatically learn features from complex data
      - Generative AI (GenAI): Uses deep learning techniques to create new content (text, images, code) by learning patterns
    - - Based on the 'generative pre-trained transformer' (GPT) model, trained on vast, diverse datasets
      - SLMs (Small Language Models): Compact models with fewer parameters, designed for lightweight and focused GenAI solutions
      - Key LLM processing concepts:
        
        Tokenization: Process of splitting text into smaller units called 'tokens'
        
        Embeddings: Numerical representations (vectors) of tokens that encode their semantic relationships
      - Transformer Model: Neural network architecture that processes context of long text sequences
      - Non-deterministic behavior: LLMs may vary their output for the same input due to probabilistic nature
      - Context Window: The amount of preceding text the model can consider
    - - Foundation LLMs (Foundation Models): General-purpose models. Require adaptation
      - Instruction-tuned LLMs: Fine-tuned with datasets pairing 'prompts' with 'expected responses'
      - Reasoning LLMs: Emphasize structured cognitive skills (e.g., logical inference, "chain-of-thought")
    - - Multimodal LLMs: Process multiple data modalities (text, images, sound, video)
      - Vision-Language Models (VLM): Specifically integrate visual (images) and textual information
      - Application in Testing: Can analyze visual elements (screenshots) alongside textual descriptions
  - - - Requirements analysis and improvement: Identify ambiguities and inconsistencies
      - Test case creation support: Generate test cases and suggest test objectives
      - Test oracle generation: Help generate expected results
      - Test data generation: Generate synthetic datasets and set boundary values
      - Test automation support: Help generate and enhance test scripts
      - Test results analysis: Help analyze results and classify anomalies
      - Testware creation: Help create documents like test plans and defect reports
    - - AI Chatbots: Provide a conversational interface (natural language) for direct LLM interaction
      - LLM-Powered Testing Applications: Integrate LLM capabilities into existing testing tools or frameworks
- - - - 6 typical components:
      - Role: Defines the perspective the GenAI should take
      - Context: Background information the model needs
      - Instruction: Clear and concise directives about the task
      - Input data: Information necessary for the task
      - Restricciones (Constraints): Limitations the LLM must follow
      - Output format: Specifications on how the response should look
    - - Prompt Chaining: Decompose a complex task into a series of intermediate steps
      - Few-shot Prompting: Provide the LLM with a few examples within the prompt
      - Zero-shot Prompting (Comparison): No examples are provided
      - One-shot Prompting (Comparison): Only one example is provided
      - Meta Prompting: Leverage the AI's ability to generate or refine its own prompts
    - - System Prompt: Defines the general personality, tone, operating rules, and constraints
      - User Prompt: The user's actual input or question
  - - - Typical tasks: Identify potential defects in the test basis, generate test conditions, prioritize test conditions, suggest relevant testing techniques
    - - Typical tasks: Test case generation, synthetic test data synthesis, automated test script generation
    - - Typical tasks: Automated script implementation, impact analysis and test optimization, self-healing tests, automated test reports
    - - Typical tasks: Test monitoring and metric analysis, test control (reprioritization), test completion insights, enhanced visualization
    - - Prompt Chaining: Best for complex tasks requiring precision and human verification
      - Few-shot Prompting: Best for repetitive tasks or those requiring a specific output format
      - Meta Prompting: Best for flexible and dynamic tasks or for creating new prompts
  - - - Accuracy: Overall correctness of the output
      - Precision: Correctness of the output with respect to a specific objective
      - Recall: Ability to identify all relevant instances
      - Relevance and Contextual Fit: Whether the output is applicable and appropriate for the context
      - Diversity: Ensuring a wide range of inputs and scenarios are covered
      - Execution Success Rate: Proportion of generated scripts that can execute successfully
      - Time Efficiency: Time saved compared to manual effort
    - - Iterative prompt modification: Start with a base prompt and gradually modify it
      - A/B testing of prompts: Create multiple prompt versions and evaluate which yields better results
      - Output analysis: Examine the output looking for inaccuracies
      - Integrate user feedback: Collect opinions from testers
      - Adjust prompt length and specificity: Experiment with different levels of detail
- - - - Hallucinations: AI generates output that appears factually incorrect or invented
      - Reasoning Errors: LLMs misinterpret logical structures
      - Biases: Stem from the training data
    - - Hallucination Detection: Cross-verification, domain expertise consultation, consistency checks
      - Reasoning Error Detection: Logical validation, output testing (executing generated tests)
      - Bias Detection: Reviewing whether generated testware precisely represents the test strategy
    - - Provide complete context, divide prompts into manageable segments (prompt chaining), use clear data formats, compare results across models
    - - Adjust 'Temperature' parameter: Lowering temperature (e.g., to 0) reduces randomness
      - Set random seeds: Ensures reproducibility
  - - - Data Privacy Concerns: Unintentional data exposure, lack of control over data usage, compliance risks
      - Security Risks: Vulnerable infrastructure, exploitation of LLM vulnerabilities, malicious input
    - - Attack Vectors: Data exfiltration, request manipulation, data poisoning, malicious code generation
    - - Data Protection Measures: Data minimization, anonymization and pseudonymization, secure storage, training
      - Additional Strategies: Systematic review of generated output, choice of a secure operating environment
  - - - Training and processing LLMs (inference) requires intensive use of computational resources, resulting in a substantial environmental load
  - - - ISO/IEC 42001:2023: Standard for managing AI systems
      - ISO/IEC 23053:2022: Framework for AI Systems using ML
      - EU AI Act: Regulation classifying applications by risk level
      - NIST AI Risk Management Framework (RMF): Guides for managing AI risks
- - - - Definition: Infrastructure that integrates an LLM into the software testing process
      - Typical Architecture: Front-end, Back-end, LLM
      - Integration of multiple data sources: Relational databases and Vector databases
    - - RAG Definition: Technique that enhances LLMs by incorporating additional external data sources
      - Runtime process: 1. Retrieval of relevant data 'chunks'. 2. Generation of 'grounded' response
    - - LLM-Powered Agents: Specialized GenAI applications for semi-autonomous or autonomous task processing
      - Key difference from chatbots: Agents can "act" by invoking functions or "tools"
      - Autonomy Levels: Autonomous and Semi-autonomous Agents
      - Multi-agent Architectures: Collaborative system where multiple agents coordinate
  - - - Fine-tuning: Adapts a pre-trained Language Model to perform specific tasks or adjust to particular domains
      - Involves additional supervised training on a focused, labeled dataset
      - Challenges: Requires high-quality datasets, mitigating 'overfitting', managing 'opacity'
    - - LLMOps (Large Language Model Operations): Practices, tools, and processes to streamline the development, deployment, and maintenance of LLMs
      - Implementation Approaches: Using an AI chatbot, using an integrated GenAI testing tool, or In-house development
- - - - Shadow AI: Use of GenAI tools or systems without formal approval
      - Risks: Data security and privacy weaknesses, regulatory compliance issues, vague intellectual property
    - - Define measurable objectives, select the correct LLMs/SLMs, ensure input data quality, establish training programs, collect metrics, and establish governance guidelines
    - - Key criteria: Model performance, Fine-tuning potential, recurring cost, community and support
    - - Phase 1: Discovery: Awareness, training, and initial experimentation
      - Phase 2: Initiation and use definition: Identify and prioritize use cases, evaluate infrastructure
      - Phase 3: Utilization and iteration: Full integration, continuous monitoring and measurement
  - - - Key skills: Mastering prompt engineering techniques, understanding the model, developing test review methods, knowledge of risks, data security implications
    - - Practical approach with various LLMs, structured learning paths, internal Communities of Practice (CoP), sharing prompt libraries
    - - Tester Evolution: Shifts to an AI-assisted testing specialist. Tasks: AI output review, prompt refinement
      - Test Manager Evolution: Updated responsibilities include: Developing an AI-based testing strategy, AI-based risk management, leading hybrid teams